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(54) Title: COMPOSITIONS AND METHODS FOR THE THERAPY AND DIAGNOSIS OF LUNG CANCER 

(57) Abstract: Compositions and methods for the therapy and diagnosis of cancer, particularly lung cancer, are disclosed. Ulustra- 
live compositions comprise one or more lung tumor polypeptides, immunogenic portions thereof, polynucleotides that encode such 
polypeptides, antigen presenting cell that expresses such polypeptides, and T cells that are specific for cells expressing such polypep- 
tides. The disclosed compositions are useful, for example, in the diagnosis, prevention and/or treatment of diseases, particularly lung 
cancer. 
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COMPOSITIONS AM) METHODS FOR THE THERAPY 
AND DIAGNOSIS OF LUNG CANCER 



TECHNICAL FIELD OF THE INVENTION 
5 The present invention relates generally to therapy and diagnosis of 

cancer, such as lung cancer. The invention is more specifically related to polypeptides, 
comprising at least a portion of a lung tumor protein, and to polynucleotides encoding 
such polypeptides. Such polypeptides and polynucleotides are useful in phamiaceutical 
compositions, e.g., vaccines, and other compositions for the diagnosis and treatment of 
10 lung cancer. 

BACKGROUND OF THE INVENTION 

Cancer is a significant health problem throughout the world. Ahhou^ 
advances have been made in detection attd therapy of cancer, no vaccme or other 
universally successfiil method for prevention and/or treatment is currmtly avaflable. 

15 Current therapies, which are generally based on a combination of chemotherapy or 
surgery and radiation, continue to prove inadequate in many patients. 

Lung cancer is the primary cause of cancer death among both men and 
women in the U.S., with an estimated 172.000 new cases bemg reported in 1994. The 
five-year survival rate among all lung cancer patients, regardless of the stage of disease 

20 at diagnosis, is only 13%. This contrasts with a five-year survival rate of 46% among 
cases detected while the disease is still localized. However, only 16% of lung cancers 
are discovered before the disease has spread. 

In spite of considerable research into therapies for these and other 
cancers, lung cancer remains difficult to diagnose and treat effectively. Accordingly, 

25 there is a need in the art for improved methods for detecting and treating such cancers. 
The present invention fiilfills these needs and fiarlher provides other related advantages. 
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SUMMARY OF THE INVENTION 

In one aspect, the present invention provides polynucleotide 
compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID N0s:l-3, 6-8, 10-13, 15-27, 29, 
5 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107- 
109, 111, 113, 125, 127. 128, 129. 131-133, 142, 144, 148-151, 153, 154, 157, 158, 
160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 
217, 220-224, 253-337, 345. 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 
431, 434, 442, 447, 450 and 467; 

10 (b) complements of the sequences provided in SEQ IDN0s:l-3, 6-8, 

10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 
84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 
210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 

15 420, 424, 428, 431, 434, 442, 447, 450 and 467; 

(c) sequences consisting of at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 
75 and 100 contiguous residues of a sequence provided m SEQ ID N0s:l-3, 6-8, 10-13, 
15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86- 
96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144. 148-151, 153, 154, 157, 

20 158, 160, 167, 168, 171, 179. 182. 184-186. 188-191, 193, 194, 198-207, 209, 210, 213, 
214, 217, 220-224, 253-337, 345, 347, 349, 358. 362. 364, 365, 368, 370-375, 420, 424, 
428, 43 1 , 434, 442, 447, 450 and 467; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
N0s:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 

25 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 
148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 
198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 
368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, under moderate or 
highly stringent conditions; 
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my. 9,./ „1 -H ^ " 96./.. 

m • 86-^ 107-10, 

H .5, ,5S. ,60, „s, ,,,, ■ 2 2'^'^^ 

'•ID, 424, 428, 431, 434, 442, 447, 450 and 467. 

.3 _ exp^ea i„ a. ,ea. a,«« 20-^ ^ , ,1 J^l^^ 

and .0. i„ a. a^ 50% „,ta, ^^^s .«ed. a. a Z12 

abou.lO-f«ldhigtea,anthatfe„™„ala=»««. 

20 V ^ " """^ P-^viO^^ polypeptide 

sequence described above. POiynucleotide 

The pi^ent invention further provides polypeptide compositions 
con^ns^g an an^o acid sequ.ce selected ^r. the group consisting of 
recited in SEO ID NOs- H9 sequences 

25 344 346 35^57 ^j j^ ' ''''''''''''' ^ 

344. 346. 350. 357, 361. 363. 365. 367, 369. 376-382. 387-419 423 427 4.n 4., 

441. 443. 446. 449 and 451-466. . ' 

In certain preferred embodiments, the polypeptides and/or 
polynucleotides of the present invent.-. • P^JPeptides and/or 

... immunogenic, /.«.. they are canable of 

30 :nL:r:^~~---'— "--c:: 
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The present invention further provides fragments, variants and/or 
derivatives of the disclosed polypeptide and/or polynucleotide sequences, wherein the 
fragments, variants and/or derivatives preferably have a level of immunogenic activity 
of at least about 50%, preferably at least about 70% and more preferably at least about 
5 90% of the level of immunogenic activity of a polypeptide sequence set forfli in SEQ ID 
NOs:152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 226-252, 338-344, 346, 350, 
357, 361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449 
and 451-466, or a polypeptide sequence encoded by a polynucleotide sequence set forth 
m SEQ ID NOs:K3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 

10 71, 73, 74, 77, 78, 80-82, 84, 86^96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 
142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 
193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 
364, 365, 368, 370-375, 420, 424, 428^ 431, 434, 442, 447, 450 and 467. 

The present invention further provides polynucleotides that encode a 

15 polypeptide described above, expression vectors comprising such polynucleotides and 
host cells transformed or transfected with such expression vectors. 

Within other aspects, the present invention provides pharmaceutical 
compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 

20 Within a related aspect of the present invention, the pharmaceutical 

compositions, e,g.^ vaccine compositions, are provided for prophylactic or therapeutic 
applications. Such compositions generally comprise an immunogenic polypeptide or 
polynucleotide of the invention and an immunostimulant, such as an adjuvant. 

The present invention further provides pharmaceutical compositions that 

25 comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a polypeptide of the present invention, or a fragment thereof; and (b) a physiologically 
acceptable carrier 

Within further aspects, the present invention provides pharmaceutical 
compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
30 described above and (b) a pharmaceutically accqjtable carrier or excipient Illustrative 
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antigen presenting ceUs include dendritic ceUs. macrophages, monocytes, fibroblasts 
and B cells. 

Withm related aspects, phaimaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described above 
5 and (b) an immunostimulant. 

The present invention further provides, in other aspects, fusion proteins 
that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the fonn of phannaceutical compositions, 
e.g., vaccine compositions, comprising a physiologically acceptable cairier and/or an 

10 immunostimulant. Hie fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof, as described herein, and may further comprise 
one or more polypeptide segments for facilitating the expression, purification and/or 
immunogenicity of the polypeptide(s). 

Within further aspects, the present invention provides methods for 

15 stimulating an immune response in a patient, preferably a T cell response in a human 

patient, comprising administering a phannaceutical composition described herein. The 
patient may be afflicted with lung cancer, in v.hich case the methods provide treatment 
for the disease, or patient considered at risk for such a disease may be treated 
prophylactically. 

aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
patient a phamiaceutical composition as recited above. The patient may be afflicted 
with lung cancer, in which case the methods provide treatment for the disease, or patient 
considered at risk for such a disease may be treated prophylactically. 

'^^ P'^^^"t invention further provides, within other aspects, methods for 
removing tumor cells fi-om a biological sample, comprising contacting a biological 
sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
permit the removal of cells expressing the protein fi-om the sample. 
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Within related aspects, methods are provided for inhibiting the 
development of a cancer in a patient, comprising administering to a patient a biological 
sample treated as described above. 

Methods are further provided, within other aspects, for stimulating 
5 and/or expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that 
expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
10 prepared as described above ai-e also provided. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
patient an effective amount of a T cell population as described above. 

The present invention fiirther provides methods for inhibiting the 
15 development of a cancer in a patient, comprising the steps of: (a) incubating CD4"^ 
and/or CDS'*" T cells isolated fi-om a patient with one or more of: (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
20 of the proliferated T ceUs, and thereby inhibiting the development of a cancer in the 
patient. Proliferated cells may, but need not, be cloned prior to administration to the 
patient. 

Within further aspects, the present invention provides methods for 
determining the presence or absence of a cancer, preferably a lung cancer, in a patient 

25 comprising: (a) contacting a biological sample obtained from a patient with a binding 
agent that binds to a polypeptide as recited above; (b) detecting in the sample an amount 
of polypeptide that binds to the binding agent; and (c) comparing the amoimt of 
polypeptide with a predetermined cut-off value, and therefrom determining the presence 
or absence of a cancer in the patient. Within preferred embodiments, the binding agent 

30 is an antibody, more preferably a monoclonal antibody. 
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The present invention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient Such methods comprise the steps 
of: (a) contacting a biological san^le obtained fiiom a patient at a first point in time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
5 sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) 
and (b) using a biological sample obtained ftom the patient at a subsequent point in 
time; and (d) comparing the amount of polypeptide detected in step (c) with the amount 
detected in step (b) and therefi-om monitoring the progression of the cancer in the 
patient. 

10 The present invention further provides, within other aspects, methods for 

determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide tiiat encodes a polypeptide of the present invention; (b) 
detecting in the sample a level of a polynucleotide, preferably mRNA. that hybridizes to 
1 5 the oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oKgonucleotide with a predetermined cut-ofif vahie, and flierefrom determining the 
presence or absence of a cancer in the patient. Within certain embodiments, the amount 
of mRNA is detected via polymerase chain reaction using, for example, at least one 
oligonucleotide primer that hybridizes to a polynucleotide encoding a polype^de as 

20 recited above, or a complement of such a polynucleotide. Within other embodiments, 
the amount of mRNA is detected using a hybridization technique, employing an 
oligonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 

In related aspects, methods are provided for monitoring the progression 

25 of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 
obtained from a patient witii an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of the present invention; (b) detecting in the sample an amount of 
a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 
using a biological sample obtained from the patient at a subsequent point in time; and 

30 (d) comparing the amount of polynucleotide detected in step (c) witii the amount 
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detected in step (b) and therefrom monitoring the progression of tlie cancer in the 
patient. 

Within further aspects, the present invention provides antibodies, such as 
monoclonal antibodies, that bind to a polypeptide as described above, as well as 
5 diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 
references disclosed herein are hereby incorporated by reference in their entirety as if 
1 0 each was incorporated individually . 

SEQUENCE IDENTIFIERS 

SEQ ID NO: 1 is the determined cDNA sequence for LST-S 1-2 
SEQ ID N0:2 is the determined cDNA sequence for LST-Sl-28 
SEQ ID N0:3 is the determined cDNA sequence for LST-S 1-90 

15 SEQ ID NO:4 is the determmed cDNA sequence for LST-Sl-144 
SEQ ID NO:5 is the determmed cDNA sequence for LST-Sl-133 
SEQ ID NO:6 is the detennined cDNA sequence for LST-Sl-169 
SEQ ID N0:7 is the determined cDNA sequence for LST-S2-6 
SEQ ID NO:8 is the determined cDNA sequence for LST-S2-1 1 

20 SEQ ID N0:9 is the determined cDNA sequence for LST-S2-17 
SEQ ID NO: 10 is the determined cDNA sequence for LST-S2-25 
SEQ ID N0:11 is the determined cDNA sequence for LST-S2-39 
SEQ ID NO: 12 is a first determined cDNA sequence for LST-S2-43 
SEQ ID NO: 13 is a second determined cDNA sequence for LST-S2-43 

25 SEQ ID NO: 14 is the determined cDNA sequence for LST-S2-65 
SEQ ID NO: 15 is the determined cDNA sequence for LST-S2-68 
SEQ ID NO: 16 is the determined cDNA sequence for LST-S2-72 
SEQ ID N0:17 is the determined cDNA sequence for LST-S2-74 
SEQ ID N0:1 8 is the determmed cDNA sequence for LST-S2-103 
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SEQ ID NO: 19 is the determined cDNA sequence for LST-S2-N1-1F 
SEQ ID NO:20 is the detennined cDNA sequence for LST-S2-N1-2A 
SEQ ID N0:21 is the detennined cDNA sequence for LST-S2-N1-4H 
SEQ ID NO:22 is the detennined cDNA sequence for LST-S2-N1-5A 
5 SEQ ID NO;23 is the determined cDNA sequence for LST-S2-N1.6B 
SEQ ID NO:24 is the detennined cDNA sequence for LST-S2-N1-7B 
SEQ ID NO:25 is the detennined cDNA sequence for LST-S2-N1-7H 
SEQ ID NO:26 is the deteimined cDNA sequence for LST-S2-N1 -8A 
SEQ ID NO:27 is the detennined cDNA sequence for LST-S2-N1-8D 
1 0 SEQ ID NO:28 is the detennined cDNA sequence for LST-S2-N1 -9A 
SEQ ID NO:29 is the detennined cDNA sequence for LST-S2-N1-9E 
SEQ ID NO:30 is the detennined cDNA sequence for LST-S2-N1-10A 
SEQ ID N0:31 is the determined cDNA sequence for LST-S2-N1-10G 
SEQ ID NO:32 is the detennined cDNA sequence for LST-S2-N1-1 1 A 
15 SEQ ID NO:33 is the detennined cDNA sequence for LST-S2-N1-12C 
SEQ ID NO:34 is the detennined cDNA sequence for LST-S2-N1-12E 
SEQ ID NO:35 is the detennined cDNA sequence for LST-S2-B1-3D 
SEQ ID NO:36 is the detennined cDNA sequence for LST-S2-B1-6C 
SEQ ID NO:37 is the detennined cDNA sequence for LST-S2-B1-5D 
20 SEQ ID NO:38 is the detennined cDNA sequence for LST-S2-B1-5F 
SEQ ID NO:39 is the detennined cDNA sequence for LST-S2-B1-6G 
SEQ ID NO:40 is the detennined cDNA sequence for LST-S2-B1-8A 
SEQ ID N0:41 is the detennined cDNA sequence for LST-S2-B1-8D 
SEQ ID NO:42 is the detennined cDNA sequence for LST-S2-B1-10A 
25 SEQ ID NO:43 is the detennined cDNA sequence for LST-S2-B1-9B 
SEQ ID NO:44 is the detennined cDNA sequence for LST-S2-B1-9F 
SEQ ID NO:45 is the detennined cDNA sequence for LST-S2-B 1-120 
SEQ ID NO:46 is the detennined cDNA sequence for LST-S2-I2-2B 
SEQ ID NO:47 is the determined cDNA sequence for LST-S2-I2-5F 
30 SEQ ID NO:48 is the detennined cDNA sequence for LST-S2-I2-6B 
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SEQIDNO:49 
SEQIDNO:50 
SEQIDN0:51 
SEQIDNO:52 
5 SEQIDNO:53 
SEQIDNO:54 
SEQIDNO:55 
SEQIDNO:56 
SEQIDNO:57 

10 SEQIDNO:58 
SEQIDNO:59 
SEQIDNO:60 
SEQ ID NO: 61 
SEQ ID NO: 62 

15 SEQ ID NO: 63 
SEQ ID NO: 64 
SEQ ID NO: 65 
SEQ ID NO: 66 
SEQ ID NO: 67 

20 SEQ ID NO: 68 
SEQ ID NO: 69 
SEQ ID NO: 70 
SEQ ID NO: 71 
SEQ ID NO: 72 

25 SEQ ID NO: 73 
SEQ ID NO: 74 
SEQ ID NO: 75 
SEQ ID NO: 76 
SEQ ID NO: 77 

30 SEQ ID NO: 78 



! the determined cDNA sequence for LST-S2-I2-7F 
5 the determined cDNA sequence for LST-S2-I2-8G 
5 the determined cDNA sequence for LST-S2-I2-9E 
J the determined cDNA sequence for LST-S2-I2-12B 
5 the determined cDNA sequence for LST-S2-H2-2C 
; the determined cDNA sequence for LST-S2-H2-1G 
i the determined cDNA sequence for LST-S2-H2-4G 
5 the determined cDNA sequence for LST-S2-H2-3H 
J the determined cDNA sequence for LST-S2-H2-5G 
5 the determined cDNA sequence for LST-S2-H2-9B 
J the determined cDNA sequence for LST-S2-H2-10H 
J the determined cDNA sequence for LST-S2-H2-12D 
s the determined cDNA sequence for LST-S3-2 
s the determined cDNA sequence for LST-S3-4 
s the determined cDNA sequence for LST-S3-7 
s the determined cDNA sequence for LST-S3-8 
s the determined cDNA sequence for LST-S3- 1 2 
s the determined cDNA sequence for LST-S3-13 
s the determined cDNA sequence for LST-S3-14 
s the determined cDNA sequence for LST-S3-16 
s the determined cDNA sequence for LST-S3-21 
s the determined cDNA sequence for LST-S3-22 
s the determined cDNA sequence for LST-Sl-7 
s the determined cDNA sequence for LST-Sl-A-IE 
s tlie determined cDNA sequence for LST-Sl-A-IG 
s the determined cDNA sequence for LST-S1-A-3E 
s the determined cDNA sequence for LST-S1-A-4E 
s the determined cDNA sequence for LST-S1-A-6D 
s the determined cDNA sequence for LST-S1-A-8D 
s the determined cDNA sequence for LST-Sl-A-lOA 
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SEQ ID NO: 79 is the determined cDNA sequence for LST-Sl-A-lOC 
SEQ ID NO: 80 is the determined cDNA sequence for LST-S1-A-9D 
SEQ ID NO: 81 is the determined cDNA sequence for LST-Sl-A-IOD 
SEQ ID NO: 82 is the determined cDNA sequence for LST-S1-A-9H 
5 SEQ ID NO: 83 is the determined cDNA sequence for LST-Sl-A-1 ID 
SEQ ID NO: 84 is the determined cDNA sequence for LST-S1-A-12D 
SEQ ID NO: 85 is (he detennined cDNA sequence for LST-Sl-A-1 IE 
SEQ ID NO: 86 is the detennined cDNA sequence for LST-S1-A-12E 
SEQ ID NO: 87 is Ihe determined cDNA sequence for L513S (T3). 

1 0 SEQ ID NO: 88 is the detennined cDNA sequence for L5 1 3S contig 1 . 
SEQ ID NO: 89 is a first determined cDNA sequence for L514S. 
SEQ ID NO: 90 is a second determined cDNA sequence for L514S. 
SEQ ID NO: 91 is a first determined cDNA sequence for L516S. 
SEQ ID NO: 92 is a second determined cDNA sequence for L5 16S. 

15 SEQ ID NO: 93 is the detennined cDNA sequence for L5 17S. 

SEQ ID NO: 94 is the extended cDNA sequence for LST-Sl-169 (also known as 
L519S). 

SEQ ID NO: 95 is a first detennined cDNA sequence for L520S. 

SEQ ID NO: 96 is a second determined cDNA sequence for L520S. 
20 SEQ ID NO: 97 is a first detennined cDNA sequence for L52 1 S. 

SEQ ID NO: 98 is a second detennined cDNA sequence for L521S. 

SEQ ID NO: 99 is the determined cDNA sequence for L522S. 

SEQ ID NO: 1 00 is the detennined cDNA sequence for L523S. 

SEQ ID NO: 101 is the determined cDNA sequence for L524S. 
25 SEQ ID NO: 1 02 is the detennined cDNA sequence for L525S. 

SEQ ID NO: 103 is the detennined cDNA sequence for L526S. 

SEQ ID NO: 1 04 is the detained cDNA sequence for L527S. 

SEQ ID NO: 1 05 is the determmed cDNA sequence for L528S. 

SEQ ID NO: 106 is the detennined cDNA sequence for L529S. 
30 SEQ ID NO: 1 07 is a fuist detennined cDNA sequence for L530S. 



11 



wo 02/00174 



PCTAJSOl/21065 



108 is a second determined cDNA sequence for L530S. 

1 09 is the determined full-length cDNA sequence for L53 1 S short fomi 

110 is the amino acid sequence encoded by SEQ ID NO: 109. 
1 1 lis the determined full-length cDNA sequence for L531S long form 

11 2 is the amino acid sequence encoded by SEQ ID NO: 111. 

1 13 is the determined full-length cDNA sequence for L520S. 

1 14 is the amino acid sequence encoded by SEQ ID NO: 1 13. 

1 1 5 is the determined cDNA sequence for contig 1 . 

1 1 6 is the determined cDNA sequence for contig 3. 

1 17 is the determined cDNA sequence for contig 4. 

1 1 8 is the determined cDNA sequence for contig 5. 

119 is the determined cDNA sequence for contig 7. 

120 is the determmed cDNA sequence for contig 8. 

121 is the determined cDNA sequence for contig 9. 

122 is the determined cDNA sequence for contig 10, 

123 is the determined cDNA sequence for contig 12. 

124 is tlie determined cDNA sequence for contig 1 1 . 

SEQ ID NO: 125 is the determined cDNA sequence for contig 13 (also known as 
L761P). 

20 SEQ ID NO: 1 26 is the determined cDNA sequence for contig 1 5. 

SEQ ID NO: 127 is the determined cDNA sequence for contig 16. 

SEQ ID NO: 128 is the determmed cDNA sequence for contig 17. 

SEQ ID NO: 129 is the determined cDNA sequence for contig 19. 

SEQ ID NO: 1 30 is the determined cDNA sequence for contig 20. 
25 SEQ ID NO: 1 3 1 is the determined cDNA sequence for contig 22. 

SEQ ID NO: 1 32 is the determined cDNA sequence for contig 24. 

SEQ ID NO: 133 is the determined cDNA sequence for contig 29. 

SEQ ID NO: 1 34 is the determined cDNA sequence for contig 3 1 . 

SEQ ID NO: 135 is the determined cDNA sequence for contig 33. 
30 SEQ ID NO: 136 is the determined cDNA sequence for contig 38. 



SEQ ID NO 
SEQ ID NO 
SEQ ID NO 
SEQ ID NO 
5 SEQ ID NO 
SEQ ID NO; 
SEQ ID NO 
SEQ ID NO 
SEQ ID NO 

10 SEQ ID NO 
SEQ ID NO 
SEQ ID NO 
SEQ ID NO 
SEQ ID NO 

15 SEQ ID NO 
SEQ ID NO 
SEQ ID NO 
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SEQ ID NO: 137 is the determined cDNA sequence for contig 39. 

SEQ ID NO: 138 is the determined cDNA sequence for contig 41. 

SEQ ID NO: 139 is the determined cDNA sequence for contig 43. 

SEQ ID NO: 140 is the determined cDNA sequence for contig 44. 
5 SEQ ID NO: 141 is the detennined cDNA sequence for contig 45. 

SEQ ID NO: 1 42 is the determined cDNA sequence for contig 47. 

SEQ ID NO: 143 is the determined cDNA sequence for contig 48. 

SEQ ID NO: 144 is the determmed cDNA sequence for contig 49. 

SEQ ID NO: 145 is the detennined cDNA sequence for contig 50. 
1 0 SEQ ID NO: 146 is the detennined cDNA sequence for contig 53. 

SEQ ID NO: 147 is the detennined cDNA sequence for contig 54. 

SEQ ID NO: 148 is the determined cDNA sequence for contig 56. 

SEQ ID NO: 149 is the determined cDNA sequence for contig 57. 

SEQ ID NO: 1 50 is the determined cDNA sequence for contig 58. 
15 SEQ ID NO: 151 is the full-length cDNA sequence for L530S. 

SEQ ID NO: 152 is the amino acid sequence encoded by SEQ ID NO: 151 

SEQ ID NO: 153 is the full-length cDNA sequence of a first variant of L514S 

SEQ ID NO: 154 is the full-length cDNA sequence of a second variant of L51 4S 

SEQ ID NO: 155 is the amino acid sequence encoded by SEQ ID NO: 153. 
20 SEQ ID NO: 1 56 is the amino acid sequence encoded by SEQ ID NO: 1 54. 

SEQ ID NO: 157 is the detennined cDNA sequence for contig 59. 

SEQ ID NO: 158 is the full-length cDNA sequence for L763P (also refen^ed to as contig 

22). 

SEQ ID NO: 159 is the amino acid sequence encoded by SEQ ID NO: 158. 
25 SEQ ID NO: 1 60 is the full-length cDNA sequence for L762P (also referred to as contig 
17). 

SEQ ID NO: 1 61 is the amino acid sequence encoded by SEQ ID NO: 160. 
SEQ ID NO: 162 is the determined cDNA sequence for L515S. 
SEQ ID NO: 1 63 is the full-length cDNA sequence of a first variant of L524S. 
30 SEQ ID NO: 1 64 is the full-length cDNA sequence of a second variant of L524S. 
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SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
5 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
56). 

SEQIDNO: 

10 SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 

15 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 

20 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 

25 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 

30 SEQIDNO: 



165 is the amino acid sequence encoded by SEQ ID NO: 163. 

166 is the amino acid sequence encoded by SEQ ID NO: 164. 

167 is the full-length cDNA sequence of a first variant of L762P. 

168 is the full-length cDNA sequence of a second variant of L762P. 

169 is the amino acid sequence encoded by SEQ ID NO: 167. 

170 is the amino acid sequence encoded by SEQ ID NO: 168. 

171 is the full-length cDNA sequence for L773P (also referred to as contig 



172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
192 
193 



IS the amino acid sequence encoded by SEQ ID NO: 171. 
s an extended cDNA sequence for L519S. 
s the amino acid sequence encoded by SEQ ID NO: 174. 
s the full-length cDNA sequence for L523 S . 
s the amino acid sequence encoded by SEQ ID NO: 175. 
s the determined cDNA sequence for LST-sub5-7A. 
s the determined cDNA sequence for LST-sub5-8G. 
s the determined cDNA sequence for LST-sub5-8H. 
s the determined cDNA sequence for LST-sub5-10B. 
is the determined cDNA sequence for LST-sub5-10H. 
s the determined cDNA sequence for LST-sub5-12B. 
s the determined cDNA sequence for LST-sub5-l IC. 
s the determined cDNA sequence for LST-sub6-lc. 
s the determined cDNA sequence for LST-sub6-2f. 
s the determined cDNA sequence for LST-sub6-2G. 
s the determined cDNA sequence for LST-sub6-4d. 
s the determined cDNA sequence for LST-sub6-4e. 
s the determined cDNA sequence for LST-sub6-4f 
s the determined cDNA sequence for LST-sub6-3h. 
IS the determined cDNA sequence for LST-sub6-5d. 
s the determined cDNA sequence for LST-sub6-5h. 
s the determined cDNA sequence for LST-sub6-6h. 
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SEQ ID NO: 194 is the determined cDNA sequence for LST-sub6-7a. 

SEQ ID NO: 195 is the determined cDNA sequence for LST-sub6-8a. 

SEQ ID NO: 196 is the determined cDNA sequence for LST-sub6-7d. 

SEQ ID NO: 197 is the determined cDNA sequence for LST-sub6-7e. 
5 SEQ ID NO: 198 is the determined cDNA sequence for LST-sub6-8e. 

SEQ ID NO: 199 is the determined cDNA sequence for LST-sub6-7g. 

SEQ ID NO: 200 is the determined cDNA sequence for LST-sub6-9f. 

SEQ ID NO: 201 is the determined cDNA sequence for LST-sub6-9h. 

SEQ ID NO: 202 is the determined cDNA sequence for LST-sub6-l lb. 
10 SEQ ID NO: 203 is the determined cDNA sequence for LST-sub6-l Ic. 

SEQ ID NO: 204 is the determined cDNA sequence for LST-sub6-12c. 

SEQ ID NO: 205 is the determined cDNA sequence for LST-sub6-12e. 

SEQ ID NO: 206 is the determined cDNA sequence for LST-sub6-12f. 

SEQ ID NO: 207 is the determined cDNA sequence for LST-sub6-l Ig. 
1 5 SEQ ID NO: 208 is the determined cDNA sequence for LST-sub6-l 2g. 

SEQ ID NO: 209 is the detennined cDNA sequwice for LST-sub6-12h. 

SEQ ID NO: 210 is the determined cDNA sequence for LST-sub6-II-la. 

SEQ ID NO: 21 1 is the determined cDNA sequence for LST-sub6-II-2b. 

SEQ ID NO: 212 is the detennined cDNA sequence for LST-sub6-II-2g. 
20 SEQ ID NO: 213 is the detennined cDNA sequence for LST-sub6-n-l h. 

SEQ ID NO: 214 is the determined cDNA sequence for LST-sub6-II-4a. 

SEQ ID NO: 21 5 is the determined cDNA sequence for LST-sub6-n-4b. 

SEQ ID NO: 216 is the determined cDNA sequence for LST-sub6-II-3e. 

SEQ ID NO: 217 is the detennined cDNA sequence for LST-sub6-II-4f. 
25 SEQ ID NO: 2 1 8 is the determined cDNA sequence for LST-sub6-U-4g. 

SEQ ID NO: 219 is the determined cDNA sequence for LST-sub6-n-4h. 

SEQ ID NO: 220 is the detennined cDNA sequence for LST-sub6-II-5c. 

SEQ ID NO: 221 is the determined cDNA sequence for LST-sub6-n-5e. 

SEQ ID NO: 222 is the detennined cDNA sequence for LST-sub6-n-6f. 
30 SEQ ID NO: 223 is the detennined cDNA sequence for LST-sub6-n-5g, 
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SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
5 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 

10 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 

15 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 

20 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 

25 SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 

30 SEQIDNO: 



224 is the determined cDNA sequence for LST-sub6-n-6g. 

225 is the amino acid sequence for L528S. 
226-25 1 are synthetic peptides derived from L762P. 



252 
253 
254 
255 
256 
257 
258 
259 
260 
261 
262 
263 
264 
265 
266 
267 
268 
269 
270 
271 
272 
273 
274 
275 
276 
277 
278 



s the expressed amino acid sequaice of L514S. 
s the DNA sequence corresponding to SEQ ID NO: 252. 
!s the DNA sequence of a L762P expression construct, 
s the determined cDNA sequence for clone 23785. 
is the determined cDNA sequence for clone 23786. 
IS the determined cDNA sequence for clone 23788. 
is the determined cDNA sequence for clone 23790. 
s the determined cDNA sequence for clone 23793. 
s the determined cDNA sequence for clone 23794. 
s the determined cDNA sequence for clone 23795. 
s the determined cDNA sequence for clone 23796. 
s the determined cDNA sequence for clone 23797. 
s the determined cDNA sequence for clone 23798. 
s the determined cDNA sequence for clone 23799. 
s the determined cDNA sequence for clone 23800. 
s the determined cDNA sequence for clone 23802. 
s the determined cDNA sequence for clone 23803. 
s the determined cDNA sequence for clone 23804. 
s the determined cDNA sequoice for clone 23805. 
s the determined cDNA sequence for clone 23806. 
s the determined cDNA sequence for clone 23807. 
s the determined cDNA sequence for clone 23808. 
s the determined cDNA sequence for clone 23809. 
s the determined cDNA sequence for clone 23810. 
s the determined cDNA sequence for clone 238 1 1 . 
s the determined cDNA sequence for clone 23812. 
s the detomined cDNA sequence for clone 23813. 
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SEQ JD NO: 279 is the determined cDNA sequence for clone 23815. 

SEQ ID NO: 280 is the detennined cDNA sequence for clone 25298. 

SEQ ID NO: 281 is the detennined cDNA sequence for clone 25299. 

SEQ ID NO: 282 is the detennined cDNA sequence for clone 25300. 
5 SEQ ID NO: 283 is the detennined cDNA sequence for clone 25301 

SEQ ID NO: 284 is the detennined cDNA sequaice for clone 25304 

SEQ ID NO: 285 is the detennined cDNA sequence for clone 25309. 

SEQ ID NO: 286 is the detennined cDNA sequaice for clone 253 12. 

SEQ ID NO: 287 is the detennined cDNA sequence for clone 25317. 
1 0 SEQ ID NO:288 is the detennined cDNA sequence for clone 2532 1 . 

SEQ ID NO:289 is the determined cDNA sequence for clone 25323, 
SEQ ID NO:290 is the determined cDNA sequence for clone 25327. 
SEQ ID NO:291 is the detennined cDNA sequaice for clone 25328. 
SEQ ID NO:292 is the detennined cDNA sequence for clone 25332. 
1 5 SEQ ID NO:293 is the detennined cDNA sequence for clone 25333. 
SEQ ID NO:294 is the detennined cDNA sequence for clone 25336. 
SEQ ID N0295 is the determined cDNA sequence for clone 25340. 
SEQ ID NO:296 is the determmed cDNA sequence for clone 25342. 
SEQ ID NO:297 is the detennined cDNA sequence for clone 25356. 
20 SEQ ID NO:298 is the detennined cDNA sequence for clone 25357. 
SEQ ID NO:299 is the determined cDNA sequence for clone 25361. 
SEQ ID NO:300 is the detennined cDNA sequence for clone 25363. 
SEQ ID NO:301 is the detennined cDNA sequence for clone 25397. 
SEQ ID NO:302 is the determined cDNA sequence for clone 25402. 
25 SEQ ID NO:303 is the determined cDNA sequence for clone 25403. 
SEQ ID NO:304 is the detennined cDNA sequence for clone 25405. 
SEQ ID NO:305 is the determined cDNA sequence for clone 25407. 
SEQ ID NO:306 is the detennined cDNA sequence for clone 25409. 
SEQ ID NO:307 is the determined cDNA sequence for clone 25396. 
30 SEQ ID NO:308 is the detennined cDNA sequence for clone 25414. 
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SEQ ID NO:309 is the determined cDNA sequence for clone 25410. 

SEQ ID N0:3 10 is the determined cDNA sequence for clone 25406. 

SEQ ID N0:3 11 is the determined cDNA sequence for clone 25306. 

SEQ BD N0:3 12 is the determined cDNA sequence for clone 25362. 
5 SEQ JD N0:3 1 3 is the determined cDNA sequence for clone 25360. 

SEQ ID NO:314 is the detamined cDNA sequence for clone 25398. 

SEQ ID NO:315 is the determined cDNA sequence for clone 25355. 

SEQ ID N0:3 1 6 is the determined cDNA sequence for clone 2535 1 . 

SEQ ID N0:3 1 7 is the determined cDNA sequence for clone 2533 1 . 
10 SEQ ID N0:3 1 8 is the determined cDNA sequence for clone 25338. 

SEQ ID NO:319 is the determined cDNA sequence for clone 25335. 

SEQ ID NO:320 is the determined cDNA sequence for clone 25329. 

SEQ ID NO:321 is the determined cDNA sequence for clone 25324. 

SEQ ID NO:322 is the detennined cDNA sequence for clone 25322. 
15 SEQ ID NO:323 is the determined cDNA sequence for clone 253 19. 

SEQ ID NO:324 is the determined cDNA sequence for clone 25316. 

SEQ ID NO:325 is the determined cDNA sequence for clone 2531 1 . 

SEQ ID NO:326 is the determined cDNA sequence for clone 25310. 

SEQ ID NO:327 is the determined cDNA sequence for clone 25302. 
20 SEQ ID NO:328 is the determined cDNA sequence for clone 25315. 

SEQ ID NO:329 is the determined cDNA sequence for clone 25308. 

SEQ ID NO:330 is the detennined cDNA sequence for clone 25303. 

SEQ ID NOs:33 1-337 are the cDNA sequences of isofoims of the p53 tumor suppressor 

homologue, p63 (also referred to as L530S). 
25 SEQ ID NOs:338-344 are the amino acid sequences encoded by SEQ ID NOs:331-337, 

respectively 

SEQ ID NO:345 is a second cDNA sequence for the antigen L763P. 

SEQ ID NO:346 is the amino acid sequence encoded by the sequence of SEQ ID NO: 

345. 

30 SEQ ID NO:347 is a determined full-length cDNA sequence for L523S. 
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SEQ ID NO;348 is the amino acid sequence encoded by SEQ ID NO: 347. 
SEQ ID NO:349 is the cDNA sequence encoding the N-terminal portion of L773P. 
SEQ ID NO:350 is the amino acid sequence of theN-tenninal portion of L773P. 
SEQ ID NO:351 is the DNA sequence for a fusion of Ral2 and the N-lerminal portion 
5 ofL763P 

SEQ ID NO:352 is the ammo acid sequence of the fusion of Ral2 and the N-tenninal 
portion of L763P 

SEQ ID NO:353 is the DNA sequence for a fusion of Ral2 and the C-terminal portion 
ofL763P 

10 SEQ ID NO:354 is the amino acid sequence of the fusion of Ral2 and the C-teimmal 
portion of L763P 
SEQIDNO:355 is a primer. 
SEQ ID NO:356 is a primer. , 

SEQ ID NO:357 is the protein sequence of expressed recombinant L762P. 
15 SEQ ID NO:358 is the DNA sequence of expressed recombinant L762P. 
SEQ ID NO:359 is a primer. 
SEQ ID NO:360 is a primer. 

SEQ ID NO:361 is the protein sequence of expressed recombinant L773P A. 

SEQ ID NO:362 is the DNA sequence of expressed recombmant L773P A. 
20 SEQ ID NO:363 is an epitope derived from clone L773P polypeptide. 

SEQ ID NO:364 is a polynucleotide encoding the polypeptide of SEQ ID NO:363. 

SEQ JD NO:365 is an epitope derived from clone L773P polypeptide. 

SEQ ID NO:366 is a polynucleotide encoding the polypeptide of SEQ ID NO:365. 

SEQ ID NO:367 is an epitope consisting of amino acids 571-590 of SEQ ID N0:161, 
25 clone L762P. 

SEQ ID NO:368 is the full-length DNA sequence for contig 13 (SEQ ID NO:125), also 
referred to as L76 IP. 

SEQ ID NO:369 is the protein sequence encoded by the DNA sequence of SEQ ID 

NO:368. 

30 SEQ ID NO:370 is an L762P DNA sequence from nucleotides 2071 -2130. 
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SEQ ID NO:371 is an L762P DNA sequence from nucleotides 1441-1500. 
SEQ ID NO:372 is an L762P DNA sequence from nucleotides 1936-1955. 
SEQ ID NO:373 is an L762P DNA sequence from nucleotides 2620-2679. 
SEQ ID NO:374 is an L762P DNA sequence from nucleotides 1801-1860. 
5 SEQ ID NO:375 is anL762P DNA sequence from nucleotides 1531-1591. 

SEQ ID NO:376 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:373. 

SEQ ID NO:377 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:370. 

10 SEQ ID NO:378 is the amino add sequence of the L762P peptide encoded by SEQ ID 
NO:372. 

SEQ ID NO:379 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:374. 

SEQ ID NO:380 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
15 NO:371. 

SEQ ID NO:381 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:375. 

SEQ ID NO:382 is the amino acid sequence of an epitope of L762P. 

SEQ ID NOs:383-386 are PGR primers. 
20 SEQ ID NOs:387-395 are ttie amino acid sequences of L773P peptides. 

SEQ ID NOs:396-419 are Ae amino acid sequences of L523S peptides. 

SEQ ID NO:420 is the detamined cDNA sequence for clone #19014. 

SEQ ID NO:421 is the forward primer PDM-278 for the L514S-13160 coding region. 

SEQ ID NO:422 is the reverse primer PDM-278 for the L514S-13 160 coding region. 
25 SEQ ID NO:423 is the amino acid sequence for the expressed recombinant L5 14S. 

SEQ ID NO:424 is the DNA coding sequence for the recombinant L514S. 

SEQ ID NO:425 is the forward primer PDM-414 for the L523S coding region. 

SEQ ID NO:426 is the reverse primer PDM-4 14 for the L523S coding re^on. 

SEQ ED NO:427 is the amino add sequence for the expressed recombinant L523S. 
30 SEQ ID NO:428 is the DNA coding sequence for the recombinant L523S. 
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SEQ ID NO:429 is the reverse primer PDM-279 for the L762PA coding region. 
SEQ ID NO:430 is the amino acid sequence for the expressed recombinant L762PA. 
SEQ ID NO:431 is the DNA coding sequence for the recombinant L762PA. 
SEQ ID NO:432 is the reverse primer PDM-300 for the L773P coding re^on. 
5 SEQ ID NO:433 is the amino acid sequence of the expressed recombinant L773P. 
SEQ ID NO:434 is the DNA coding sequence for the lecombmant L773P. 
SEQ ID NO:435 is the forward primer for TCR ValphaS. 
SEQ ID NO:436 is the reverse primer for TCR Valpha8. 
SEQ ID NO:437 is the forward primer for TCR Vbetag. 
10 SEQ ID NO:438 is the reverse primer for TCR Vbetag. 

SEQ ID NO:439 is the TCR Valpha DNA sequence of the TCR clone specific for the 
lung antigen L762P. 

SEQ ID NO:440 is the TCR Vbeta DNA sequence of the TCR clone specific for the 
lung antigen L762P. 
15 SEQ ID NO:441 is the amino acid sequence of L763 peptide #2684. 

SEQ ID NO:442 is the predicted full-length cDNA for the cloned partial sequence of 
clone L529S (SEQ ID NO: 106). 

SEQ ID NO:443 is the deduced ammo acid sequence encoded by SEQ ID NO:442 
SEQ ID NO:444 is the forward primer PDM-734 for the coding region of clone L523S. 
20 SEQ ID NO:445 is the reverse primer PDM-735 for the coding region of clone L523S. 
SEQ ID NO:446 is the amino acid sequence for the expressed recombinant L523S. 
SEQ ID NO:447 is the DNA coding sequence for the recombinant L523S. 
SEQ ID NO:448 is another forward primer PDM-733 for the coding region of clone 
L523S. 

25 SEQ ID NO:449 is the amino acid sequence for a second expressed recombinant L523S. 
SEQ ID NO:450 is the DNA coding sequence for a second recombinant L523S, 
SEQ ID NO:451 corresponds to amino acids 86-110, an epitope of L514S-specific in 
the generation of antibodies. 

SEQ ID NO:452 corresponds to amino acids 21-45, an epitope of L514S-specific in the 
30 generation of antibodies. 
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SEQ ID NO:453 corresponds to amino acids 121-135, an epitope of L514S-specific in 
the generation of antibodies. 

SEQ ID NO:454 corresponds to amino acids 440-460, an epitope of L523S-specific in 
the generation of antibodies. 
5 SEQ ID NO:455 corresponds to amino acids 156-175, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:456 corresponds to amino acids 326-345, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:457 conesponds to amino acids 40-59, an epitope of L523S-specific in the 
1 0 generation of antibodies. 

SEQ ID NO:458 corresponds to amino acids 80-99, an epitope of L523S-specific in the 
generation of antibodies. 

SEQ ID NO:459 corresponds to amino acids 160-179, an epitope of L523S-specific in 
the generation of antibodies. 
15 SEQ ID NO:460 corresponds to amino acids 180-199, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:461 corresponds to amino acids 320-339, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:462 corresponds to amino acids 340-359, an epitope of L523S-specific in 
20 the generation of antibodies. 

SEQ ID NO:463 corresponds to amino acids 370-389, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:464 corresponds to amino acids 380-399, an epitope of L523S-specific in 
the generation of antibodies. 
25 SEQ ID NO:465 corresponds to amino acids 37-55, an epitope of L523S-recognized by 
the L523S-specific CTL line 6B1. 

SEQ ID NO:466 corresponds to amino acids 41-51, the mapped antigenic epitope of 
L523S-recognized by the L523S-specific CTL line 6B1. 

SEQ ID NO:467 corresponds to the DNA sequence which encodes SEQ ID NO:466. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed generally to compositions and their use 
in the therapy and diagnosis of cancer, particularly lung cancen As described further 
5 below, illustrative compositions of the present invention include, but are not restricted 
to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding such 
polypeptides, antibodies and other binding agents, antigen presenting cells (APCs) and 
immune system cells (eg., T cells). 

The practice of the present invention will employ, unless indicated 
10 specifically to the contrary, conventional methods of vh-ology, immunology, 
microbiology, molecular biology and recombmant DNA techniques within the skill of 
the art, many of which are described below for the purpose of illustration. Such 
techniques are explained folly in the literature. See, eg., Sambrook, et al. Molecular 
Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: 
15 A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & H (D. 
Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid 
Hybridization (B. Hames & S. Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, 
A Practical Guide to Molecular Cloning (1984). 
20 AH publications, patents and patent applications cited herein, whether 

supra or iiUfra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the singular fomis 
"a," "an" and "the" include plural references unless the content clearly dictates 
otherwise. 

25 Polvpeptide Compositions 

As used herein, the term "polypeptide" " is used m its conventional 
meanmg, as a sequence of amino acids. The polypeptides are not limited to a 
specific length of the product; thus, peptides, oligopeptides, and proteins are included 
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within the definition of polypeptide, and such terms may be used interchangeably herein 
unless specifically indicated otherwise. This term also does not refer to or exclude post- 
expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like, as well as other modifications known in the art, both 
5 naturally occurring and non-naturally occurring. A polypeptide may be an entire 
protein, or a subsequence thereof. Particular polypeptides of interest in the context of 
this invention are amino acid subsequences comprising epitopes, i.e., antigenic 
determinants substantially responsible for the immunogenic properties of a polypeptide 
and being capable of evoking an immune response. 

10 Particularly illustrative polypeptides of the present invention comprise 

those encoded by a polynucleotide sequence set forth in any one of SEQ ID N0s:l-3, 6- 
8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80- 
82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 

15 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 
420, 424, 428, 431, 434, 442, 447, 450 and 467, or a sequence that hybridizes under 
moderately stringent conditions, or, alternatively, under highly stringent conditions, to a 
polynucleotide sequence set forth in any one of SEQ ID N0s:l-3, 6-8, 10-13, 15-27, 29, 
30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107- 

20 109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 
160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 
217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 
431, 434, 442, 447, 450 and 467. Certain illustrative polypeptides of the invention 
comprise amino acid sequences as set forth in any one of SEQ ID NOs:152, 155, 156, 

25 165, 166, 169, 170, 172, 174, 176, 226-252, 338-344, 346, 350, 357, 361, 363, 365, 
367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449 and 451-466. 

The polypeptides of the present invention are sometimes herein referred 
to as lung timior proteins or lung tumor polypeptides, as an indication that their 
identification has been based at least in part upon their increased levels of expression in 

30 lung tumor samples. Thus, a "lung tumor polypeptide'* or "lung tumor protein," refers 
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generally to a polypeptide sequence of the present invention, or a polynucleotide 
sequence encoding such a polypeptide, that is expressed in a substantial proportion of 
lung tumor samples, for example preferably greater than about 20%, mwe preferably 
greater than about 30%, and most preferably greater than about 50% or more of lung 
5 tumor samples tested, at a level that is at least two fold, and preferably at least five fold, 
greater than the level of expression m nonnal tissues, as detemiined using a 
representative assay provided hereia A lung tumor polypeptide sequence of the 
invention, based upon its increased level of expression in tumor cells, has particular 
utility both as a diagnostic marker as well as a therapeutic target, as further described 
10 below. 

In certain preferred embodiments, the polypeptides of the invention are 
immunogenic, they react detectably within an immunoassay (such as an EUSA or 
T-cell stimulation assay) with antisera and/or T-cells from a patient with lung cancer. 
Screening for immunogenic activity can be performed using techniques well known to 
15 the skilled artisan. For example, such screens can be performed using methods such as 
those described in Harlow and Lane, Antibodies: A Laboratory Manual. Cold Spring 
Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 
immobilized on a solid support and contacted with patient sera to allow binding of 
antibodies within the sera to the immobiUzed polypeptide. Unbound sera may then be 

20 i«moved and bound antibodies detected using, for example, ''^-labeledPix)teinA. 

As would be recognized by the skiUed artisan, immunogenic portions of 
the polypeptides disclosed herein are also encompassed by the present invention. An 
"immunogenic portion," as used herein, is a fragment of an immunogenic polypeptide 
of the invention that itself is immunologically reactive {i.e., specifically binds) with the 

25 B^lls and/or T-cell surface antigen receptors that recognize the polypeptide. 
Lnmunogemc portions may generally be identified using well known techniques, such 
as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 
1993) and references cited therein. Such techniques include screening polypeptides for 
the ability to react with antigen-specific antibodies, antisera and/or T-ceU lines or 

30 clones. As used herein, antisera and antibodies are "antigen-specific" if they 

25 
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specifically bind to an antigen (z.e, they react with the protein in an ELISA or other 
immunoassay, and do not react detectably with unrelated proteins). Such antisera and 
antibodies may be prepared as described herein, and using well-known techniques. 

In one preferred embodiment, an immunogenic portion of a polypeptide 
5 of the present invention is a portion that reacts with antisera and/or T-cells at a level that 
is not substantially less than the reactivity of the full-length polypeptide {e,g,, in an 
ELISA and/or T-cell reactivity assay). Preferably, the level of immxmogenic activity of 
the immunogenic portion is at least about 50%, preferably at least about 70% and most 
preferably greater than about of the immimogenicity for the full-length 

10 polypeptide. In some instances, preferred immunogenic portions will be identified that 
have a level of immunogenic activity greater than that of the corresponding full-length 
polypeptide, e.g., having greater than about 100% or 150% or more immimogenic 
activity. 

In certain other embodiments, illustrative immunogenic portions may 

15 include peptides in which an N-terminal leader sequence and/or transmembrane domain 
have been deleted. Other illustrative inamunogenic portions will contain a small N- 
and/or C-terminal deletion (e.g„ 1-30 amino acids, preferably 5-15 amino acids), 
relative to the mature protein. 

In another embodiment, a polypeptide composition of the invention may 

20 also comprise one or more polypeptides that are immunologically reactive with T cells 
and/or antibodies generated against a polypeptide of the invention, particularly a 
polypeptide having an amino acid sequence disclosed herein, or to an immunogenic 
fiagment or variant thereof 

In another embodiment of the invention, polypeptides are provided that 

25 comprise one or more polypeptides that are capable of eliciting T cells and/or antibodies 
that are immunologically reactive with one or more polypeptides described herein, or 
one or more polypeptides encoded by contiguous nucleic acid sequences contained in 
the polynucleotide sequences disclosed herein, or immunogenic fragments or variants 
thereof, or to one or more nucleic acid sequences which hybridize to one or more of 

30 these sequences under conditions of modemte to high stringency. 
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The present invention, in another aspect, provides polypeptide fragments 
comprising at least about 5, 10, 15, 20, 25, 50. or 100 contiguous amino acids, «r more, 
including all intermediate lengths, of a polypeptide compositions set forth herein, such 
as those set forth in SEQ ID NOs:152. 155, 156, 165, 166, 169, 170, 172, 174, 176, 
5 226-252, 338-344, 346, 350, 357. 361, 363. 365. 367, 369. 376-382 and 387-419, 441,' 
443. 446. 449 and f 51-466, or those encoded by a polynucleotide sequence set forth in a 
sequence of SEQ ID N0s:l-3, 6-8, 10-13. 15-27. 29. 30, 32, 34-49, 51, 52, 54. 55. 57- 
59. 61-69, 71, 73, 74, 77. 78. 80-82. 84. 86-96. 107-109. Ill, 113, 125, 127, 128, 129, 
131-133, 142. 144. 148-151. 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186,' 
10 188-191. 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349,' 
358. 362. 364, 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467.' 

In another aspect, the present invention provides variants of the 
polypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%. 75o/o, 
15 80%, 85%, 90%, 910/0, 92o/„, 93%, 940/0, 95o/„, 96o/,. 97o/o. 98o/o. or 99% or more identit^ 
(determined as described below), along its length, to a polypeptide sequences set forth 



herein. 



In one preferred embodiment, the polypeptide fragments and variants 
provide by the present invention are immunologicaUy reactive with an antibody and/or 
20 T-cell that reacts with a fiill-length polypqrtide specifically set for the herein. 

In another prefeired embodiment, the polypeptide fragments and variants 
provided by the present invention exhibit a level of immunogenic activity of at least 
about 500/0. preferably at least about 70o/o, and most preferably at least about 90% or 
more of that exhibited by a full-length polypeptide sequence specifically set forth 



25 herein. 



A polypeptide 'Variant," as the term is used herein, is a polypeptide that 
typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 
occurring or may be synthetically generated, for example, by modifying one or more of 
30 the above polypeptide sequences of the invention and evaluating their immunogenic 
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activity as described herein and/or using any of a number of techniques well known in 
the art. 

For example, certain illustrative variants of the polypeptides of the 
invention include those in which one or more portions, such as an N-tennmal leader 
5 sequence or transmembrane domain, have been removed. Other illustrative variants 
include variants in which a small portion 1-30 amino acids, preferably S-IS amino 
acids) has been removed from the N- and/or C-teiminal of the mature protein. 

hi many instances, a variant will contain conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 

10 amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect tlie secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of tlie polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide 

15 with desirable characteristics, e.g., with immunogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 
even an improved, immmiogenic variant or portion of a polypeptide of the invention, 
one skilled in the art will typically change oiie or more of the codons of the encoding 
DNA sequence according to Table 1 . 

20 For example, certain amino acids may be substituted for other amino 

acids in a protein structure without appreciable loss of interactive binding capacity with 
structwes such as, for example, antigen-binding regions of antibodies or binding sites 
on substrate molecules. Since it is the interactive capacity and nature of a protein that 
defines that protein's biological fiinctional activity, certain amino acid sequence 

25 substitutions can be made in a protein sequence, and, of course, its underlying DNA 
coding sequence, and nevertheless obtain a protein with like properties. It is thus 
contemplated that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 

30 
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Table 1 



Amino Acids 








Codons 




Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 


Cysteine 


Cys 


C 


UGC 


UGU 






Aspaitic acid 


Asp 


D 


GAC 


GAU 






Glutamic acid 


GIu 


E 


GAA 


GAG 






Phenylalanme 


Phe 


F 


uuc 


UUU 






Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 


Histidine 


His 


H 


CAC 


CAU 






Isoieucine 


He 


I 


AUA 


AUC 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


Methionine 


Met 


M 


AUG 








Asparagine 


Asn 


N 


AAC 


AAU 






Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 


Glutamine 


Gin 


Q 


CAA 


CAG 






Arginine 


Aig 


R 


AGA 


AGO 


CGA 


CGC 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 


Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 


Tryptophan 


Tip 


W 


UGG 








Tyrosine 


Tyr 


Y 


UAC 


UAU 







CUG CUU 



UCG UCU 



In making such changes, the hydropathic index of amino acids may be 
5 considered. The importance of the hydropathic amino acid index in conferring 
interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other 
1 0 molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of its 

29 
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hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
5 glutamine (-3.5); aspartate (-3.5); asparagine (-3,5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other 
amino acids having a similar hydropathic index or score and still result in a protein with 
similar biolo^cal activity, ie. still obtain a biological functionally equivalent protein. 
In making such changes, the substitution of amino acids whose hydropathic indices are 

10 within ±2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 
are even more particularly preferred. It is also understood in the art that the substitution 
of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 
greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 

15 its adjacent amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,101, the following hydrophilicity values 
have been assigned to amino acid residues: argmine (+3.0); lysine (+3.0); aspartate 
(+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 + 1); alanine (-0.5); histidine (-0.5); cysteine 

20 (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 
2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within +2 

25 is preferred, those within +1 are particularly preferred, and those within ±0.5 are even 
more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 

30 take various of the foregoing characteristics into consideration are well known to those 
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of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 

In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, tlie addition of 
5 flanking sequences at the 5* and/or 3' ends; the use of phosphorothioate or T 0-methyl 
rather thm phosphodiesterase linkages in the backbone; and/or the inclusion of 
nontraditional bases such as inosiue, queosine and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guamne, thymine and 
uridine. 

1^ Amino acid substitutions may further be made on the basis of similarity 

in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 
nature of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids with uncharged polar head groups having similar hydrophilicity values 

15 mclude leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 
represent conservative changes include: (1) ala, pro, gly, glu, asp, ghi, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, tip, 
his. A variant may also, or altematively, contam non-conservative changes. In a 

20 preferred embodiment, variant polypeptides differ fi'om a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 
altematively) be modified by, for example,- the deletion or addition of amino acids tiiat 
have minimal influence on the inununogenicity, secondary structure and hydropathic 
nature of the polypeptide. 

25 As noted above, polypeptides may comprise a signal (or leader) sequence 

at the N-terminal end of the protein, which co-translationally or post-translationally 
directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
otiier sequence for ease of synthesis, purification or identification of the polypeptide 
(e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For 

30 example, a polypeptide may be conjugated to an immunoglobulin Fc region. 
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When comparing polypeptide sequences, two sequences are said to be 
"identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
5 window to identify and compare local regions of sequence similarity. A "comparison 
window*' as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

10 Optimal alignment of sequences for comparison may be conducted using 

the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 

15 In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washmgton DC Vol. 5, Suppl. 3, pp. 345-358; Hem J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and MuUer W. (1988) CABIOS ¥:1M7; Robinson, 

20 E.D. (1971) Comb. Theor 77:105; Santou, N. Nes, M. (1987) Mol Biol EvoL 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 
Lipman, D.J. (1983) Proc. Natl Acad, Sci. USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 

25 conducted by the local identity algorithm of Smith and Waterman (1981) Add, APL 
Moth 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol Biol 48:443, by the search for sunilarity methods of Pearson and Lipman (1988) 
Proc. Natl Acad. Scl USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
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Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for detennining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
5 algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
and Altschul et al. (1990) ^ Mol Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to detennine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is pubhcly available through the National Center for 

10 Biotechnology hiformation. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue aUgnments; or the end of either sequence is 

15 reached. The BLAST algorithm parameters W, T and X detennine the sensitivity and 
speed of the alignment. 

hi one preferred approach, the "percentage of sequence identity" is 
detennined by comparing two optunally aligned sequences over a wmdow of 
comparison of at least 20 positions, wherem the portion of the polypeptide sequence in 

20 the comparison window may comprise additions or deletions {i.e.. gaps) of 20 percent 
or less, usuaUy 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 
which the identical amino acid residue occurs in both sequences to yield the number of 

25 matched positions, dividing the number of matched positions by the total number of 
positions in the reference sequence {i.e., the window size) and multiplying the results by 
100 to yield the percentage of sequence identity. 

Within other illustrative embodiments, a polypeptide may be a fusion 
polypeptide that comprises multiple polypeptides as described herem, or that comprises 

30 at least one polypeptide as described herein and an unrelated sequence, such as a known 
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tumor protein. A fusion partner may, for example, assist in providing T helper epitopes 
(an immunological fusion partner), preferably T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fusion partners are both immunological 
5 and expression enhancing fusion partners. Other fusion partners may be selected so as 
to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to 
desired intracellular compartments. Still further fusion partners include affinity tags, 
which facilitate purification of the polypeptide. 

Fusion polypeptides may generally be prepared using standard 

10 techniques, includmg chemical conjugation. Preferably, a fusion polypeptide is 
expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fiised polypeptide, in an expression system. Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 
appropriate expression vector. The 3' end of the DNA sequence encoding one 

15 polypeptide component is ligated, with or witliout a peptide linker, to the 5* end of a 
DNA sequence encoding the second polypeptide component so that the reading frames 
of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retains the biological activity of both component polypeptides. 

A peptide linka: sequence may be employed to separate the first and 

20 second polypeptide components by a distance sufiGcient to ensure that each polypeptide 
folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypeptide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 
(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 

25 secondary structure that could interact with functional epitopes on the first and second 
polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
used in the linker sequence. Amino acid sequences which may be usefully employed as 

30 linkers mclude those disclosed in Maratea et al.. Gene ^0:39-46, 1985; Murphy et al.. 
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Proc. Natl. Acad. Sci. USA 55:8258-8262, 1986; U.S. Patent No. 4,935^3 and U.S. 
Patent No. 4,751,180. TTie Hnker sequence may generaUy be from 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino add regions that can be used to 
5 separate the functional domains and prevent steric interference. 

The ligated DNA sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements 
responsible for expression of DNA are located only 5' to the DNA sequaice encoding 
the first polypeptides. Similarly, stop codons required to end translation and 
10 transcription termination signals are only present 3' to tlie DNA sequence encoding the 
second polypeptide. 

The fusion polypeptide can comprise a polypeptide as described herein 
together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus. 
15 tuberculosis and hepatitis proteins {see, for example, Stoute et al. New Engl J. Med^ 
336:i6-9\, 1997). 

to one preferred embodiment, the immunological fusion partner is 
derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
fragment. Ral2 compositions and methods for theur use in enhancing the expression 

20 and/or immunogenicity of heterologous polynucleotide/polypeptide sequences is 
described in U.S. Patent Application 60/158,585, the disclosure of which is 
incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 

25 and avirulent strains of M. tuberculosis. The nucleotide sequence and amino acid 
sequence of MTB32A have been described (for example, U.S. Patent Application 
60/158,585; see also, Skeiky et al. Infection and Immun. (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the 

30 purification process. Moreover. Ral2 may enhance the immunogenicily of heterologous 
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unmimogenic polypeptides with which it is fused. One preferred Ral2 fusion 
polypeptide comprises a 14 KD C-terminal j&agment corresponding to amino acid 
residues 192 to 323 of MTB32A. 

Other preferred Ral2 polynucleotides generally comprise at least about 
5 15 consecutive nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, 
at least about 100 nucleotides, at least about 200 nucleotides, or at least about 300 
nucleotides that encode a portion of a Ral2 polypeptide. 

Ral2 polynucleotides may comprise a native sequence (te,, an 
mdogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may 

10 comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one 
or more substitutions, additions, deletions and/or insertions such that the biological 
activity of the encoded fusion polypeptide is not substantially diminished, relative to a 
fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at 
least about 70% identity, more preferably at least about 80% identity and most 

15 preferably at least about 90% identity to a polynucleotide sequence that encodes a native 
Ral2 polypeptide or a portion thereof. 

Within other preferred embodiments, an immunological fusion partner is 
derived from protem D, a surface protein of the gram-negative bacterium Haemophilus 
mfluenza B (WO 91/18926). Preferably, a protein D derivative comprises 

20 approximately the first third of the protein (e.g., the first N-terminal 100-110 amino 
acids), and a protein D derivative may be lipidated. Within certain preferred 
embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 
increase the expression level in E, coli (thus functioning as an expression enhancer). 

25 The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fiision partners include the non-structural protein from influenzae virus, NSl 
(hemaglutinm). Typically, the N-terminal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

In another embodiment, the immunological fusion partner is the protein 

30 known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is 
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derived from Streptococcus pneumoniae, which synthesizes an N-aeetyl-L-alanine 
amidase known as amidase LYTA (encoded by the LytA gene; Gene 43-265-291, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 
backbone. The C-tenninal domain of the LYTA protein is responsible for the affinity to 
5 the choline or to some choline analogues such as DEAE. This property has been 
exploited for the development of £ coli C-LYTA expressing plasmids useful for 
expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA 
fi-agmait at the amino terminus has been described (see Biotechnology 70:795-798, 
1992). Within a preferred embodiment, a repeat portion of LYTA may be incoiporated 
10 into a fusion polypeptide. A repeat portion is found in the C-terminal region starting at 
residue 178. A particularly prefeired repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fiision polypeptides, and 
the polynucleotides encoding them, wherein the fusion partner comprises a targeting 
signal capable of directing a polypeptide to the endosomaWysosomal compartment, as 
15 described in U.S. Patent No. 5,63334. An immunogenic polypeptide of the invention, 
when fused with this taigeting signal, will associate more efficiently with MHC class H 
molecules and thereby provide enhanced in vivo stimulation of CD4^ T-cells specific 
for the polypeptide. 

Polypeptides of the invention are prepared using any of a variety of well 
20 known synthetic and/or recombinant techniques, the latter of which are further 
described below. Polypeptides, portions and other variants generally less than about 
150 amino adds can be generated by synthetic means, using techniques well known to 
those of ordinary skill in the art. to one illustrative example, such polypeptides are 
synthesized using any of the commercially available solid-phase techniques, such as the 
25 MeiTifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 55:2149-2146, 1963. 
Equipment for automated synthesis of polypeptides is commercially available from 
suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City. CA). and 
may be operated according to the manufacturer's instructions. 
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In general, polypeptide compositions (including fusion polypeptides) of 
the invention are isolated. An "isolated" polypeptide is one that is removed from its 
original environment. For example, a naturally-occurring protein or polypeptide is 
isolated if it is separated from some or all of the coexisting materials in the natural 
5 system. Preferably, such polypeptides are also purified, e.g., are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
pure. 

Polynucleotide Compositions 

The present invention, in other aspects, provides polynucleotide 

10 compositions. The terms "DNA" and "polynucleotide" are used essentially 
interchangeably herein to refer to a DNA molecule that has been isolated free of total 
genomic DNA of a particular species. "Isolated," as used herein, means that a 
polynucleotide is substantially away from other coding sequences, and that the DNA 
molecule does not contain large portions of xmrelated coding DNA, such as large 

15 chromosomal fragments or other frinctional genes or polypeptide coding regions. Of 
coiirse, this refers to the DNA molecule as originally isolated, and does not exclude 
genes or coding regions later added to the segment by the hand of man. 

As will be understood by those skilled in the art, the polynucleotide 
compositions of this invention can include genomic sequences, extra-genomic and 

20 plasmid-encoded sequences and smaller engineered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments may 
be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 
invention may be single-stranded (coding or antisense) or double-stranded, and may be 

25 DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules may include 
HnRNA molecules, which contain introns and correspond to a DNA molecule in a one- 
to-one manner, and mRNA molecules, which do not contain introns. Additional coding 
or non-coding sequences may, but need not, be present within a polynucleotide of the 

38 



wo 02/00174 



PCTAJSOl/21065 



10 



present invention, and a polynucleotide may, but need not, be Knked to other molecules 
and/or support materials. 

Polynucleotides may comprise a native sequence (U, an endogenous 
sequence that encodes a polypeptide/protein of the invention or a portion thereof or 
5 may comprise a sequence that encodes a variant or derivative, pi^ferably and 
immunogenic variant or derivative, of such a sequence. 

Therefore, according to another aspect of the present invention, 
polynucleotide compositions are provided that comprise some or all of a polynucleotide 
sequence set forth m any one of SEQ ID N0s:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49 
51, 52, 54. 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-loJ, 111 U3 
125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157. 158, 160. 167. 168* 
171. 179, 182, 184-186, 188-191, 193, 194, 198-207,209.210.213.214,217,220-224 
253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375. 420. 424. 428. 431, 434,' 
442, 447, 450 and 467. complements of a polynucleotide sequence set forth in any one' 
15 of SEQ ID N0s:l-3, 6-8, 10-13. 15-27. 29. 30. 32. 34-49. 51. 52. 54, 55. 57-59. 61-69. 
71, 73, 74, 77, 78. 80-82. 84, 86-96. 107-109, 111. 113. 125. 127. 128. 129, 131-133 
142, 144. 148-151. 153. 154. 157. 158. 160. 167. 168. 171, 179,. 182. 184-186! 188-19l' 
193. 194. 198-207. 209. 210. 213. 214. 217. 220-224. 253-337. 345. 347. 349. 358 362 
364. 365. 368. 370-375. 420. 424. 428, 431. 434, 442. 447. 450 and 467. and degenerate' 
20 variants of a polynucleotide sequence set forth m any one of SEQ ID N0s:l-3, 6-8, 10- 
13, 15-27, 29. 30. 32. 34-19. 51. 52. 54. 55. 57-59, 61-69, 71, 73, 74, 77, 78, 80-82 84 
86-96, 107-109. 111. 113. 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154,' 
157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191. 193, 194, 198-207, 209, 21 o' 
213, 214, 217. 220-224. 253-337, 345, 347. 349, 358, 362, 364, 365, 368, 370-375,' 42o' 
25 424, 428. 431. 434, 442, 447. 450 and 467. In certain preferred embodiment^ thi 
polynucleotide sequences set forth herein encode immunogenic polypeptides, as 
described above. 

In other related embodiments, the present mvention provides 
polynucleotide variants having substantial identity to the sequences disclosed herein in 
30 SEQ ID N0s:l-3, 6-8, 10-13, 15-27, 29. 30, 32, 34-49. 51. 52. 54. 55. 57-59, 61-69. 71. 
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73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 
144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 
194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 
365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, for example those 
5 comprising at least 70% sequence identity, preferably at least 75%, 80%, 85%, 90%, 
95%, 96%, 97%, 98%, or 99% or higher, sequence identity compared to a 
polynucleotide sequence of this invention using the methods described herein, (eg., 
BLAST analysis using standard parameters, as described below). One skilled in this art 
will recognize that these values can be appropriately adjusted to determine 
10 corresponding identity of proteins encoded by two nucleotide sequences by taking into 
account codon degeneracy, amino acid similarity, reading frame positioning and the 
like. 

Typically, polynucleotide variants wDl contain one or more substitutions, 
additions, deletions and/or insertions, preferably such that the immxmogenicity of the 

15 polypeptide encoded by the variant polynucleotide is not substantially diminished 
relative to a polypeptide encoded by a polynucleotide sequence specifically set forth 
herein). The term ''variants" should also be understood to encompasses homologous 
genes of xenogenic origin. 

In additional embodiments, the present invention provides 

20 polynucleotide fragments comprising various lengths of contiguous stretches of 
sequence identical to or complementary to one or more of the sequences disclosed 
herein. For example, polynucleotides are provided by this invention that comprise at 
least about 10, 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 
contiguous nucleotides of one or more of the sequences disclosed herein as well as all 

25 intermediate lengths there between. It will be readily understood that "intermediate 
lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc; 100, 101, 102, 103, 
etc.; 150, 151, 152, 153, etc.; including all mtegers through 200-500; 500-1,000, and the 
like. 
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In another embodiment of the invention, polynucleotide compositions are 
provided that are capable of hybridizing under moderate to high stringency conditions to 
a polynucleotide sequence provided herein, or a fragment thereof, or a complementaiy 
sequence thereof. Hybridization techniques are well knovm in the art of molecular 
5 biology. For pun,oses of iUustration. suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other polynucleotides 
inchide prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); 
hybridizing at SOoC-eOoC, 5 X SSC. overnight; foUowed by washing twice at 65'>C for 
20 minutes with each of 2X. 0.5X and 0.2X SSC containing 0.1% SDS. One skilled in 
10 the art will understand fliat the stringency of hybridization can be readily manipulated, 
such as by altering the salt content of the hybridization solution and/or tiie temperature 
at which tiie hybridization is performed. For example, in another embodiment, suitable 
highly stringent hybridization conditions include tiiose described above, with the 
exception tiiat tiie temperatine of hybridization is increased, e.g., to 60-65'>C or 65- 
15 70°C. 

In certain preferred embodiments, the polynucleotides described above. 
e.g., polynucleotide variants, fragments and hybridizing sequences, encode polypeptides 
that are hmnmiologicaUy cross-reactive with a polypeptide sequence specifically set 
forth herein. In other preferred embodiments, such polynucleotides encode 
20 polypeptides that have a level of immunogenic activity of at least about 50%, preferably 
at least about 70%, and more preferably at least about 90% of that for a polypeptide 
sequence specifically set forth herem. 

The polynucleotides of the present invention, or fragments tiiereof, 
regardless of the lengtii of the coding sequence itself, may be combined with other DNA 

25 sequences, such as promoters, polyadenylation signals, additional restriction enzyme 
sites, multiple cloning sites, otiier coding segments, and tiie like, such that tiieir overall 
length may vary considerably. It is therefore, contemplated that a nucleic acid fragment 
of atoiost any length may be employed, witi, the total length preferably being limited by 
the ease of preparation and use in the intended recombinant DNA protocol. For 

30 example, illustrative polynucleotide segments with total lengtiis of about 10,000. about 
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5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 

base pairs in length, and the like, (including all intermediate lengths) are contemplated 

to be useful in many implementations of this invention. 

When comparing polynucleotide sequences, two sequences are said to be 

5 "identical'' if the sequence of nucleotides in the two sequences is the same when aligned 

for maximum correspondence, as described below. . Comparisons between two 

sequences are typically performed by comparing the sequences over a comparison 

window to identify and compare local regions of sequence similarity. A "comparison 

window" as used herein, refers to a segment of at least about 20 contiguous positions, 

« 

10 usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 

15 Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 

20 Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS -/:11-17; Robinson, 
E.D. (1971) Comb, Theor 11:105; Santou, N. Nes, M. (1987) MoL Biol Evol 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 

25 Practice of Numerical Taxonomy^ Freeman Press, San Francisco, CA; Wilbur, W.J. and 
Lipman, D.J. (1983) Proc, Natl Acad, Scl USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 

30 Mol Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
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Proc. Natl Acad. Sci. USA 85: 2444, by computerized implementations of Ihese 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

5 One preferred example of algorithms that arc suitable for detennining 

percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
and Altschul et al. (1990) J. Mol. Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
10 sequence identity for the polynucleotides of the invention. Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology 
hiformation. In one illustrative example, cumulative scores can be calculated usmg, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 
1 5 the word hits in each direction are halted v^dien: the cumulative alignment score falls off 
by the quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment The BLASTNf program (for 
10 nucleotide sequences) uses as defaults a wordlength (W) of 1 1, and expectation (E) of 
10, and the BLOSUM62 scoring matrix (see Henikoff and HenikoflF(1989) Proc. Natl. 
Acad. Set. USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and 
a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 
!5 comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
0 percentage is calculated by detennining the number of positions at which the identical 
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nucleic acid bases occurs in both sequences to yield the nximber of matched positions, 
dividing the number of matched positions by the total number of positions in the 
reference sequence the window size) and multiplying the results by 100 to yield the 
percentage of sequence identity. 
5 It will be appreciated by those of ordinary skill m the art that, as a result 

of the degeneracy of the genetic code, there are many nucleotide sequences that encode 
a polypeptide as described herein. Some of these polynucleotides bear minimal 
homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
that vary due to differences in codon usage are specifically contemplated by the present 

10 invention. Further, alleles of the genes comprising the polynucleotide sequences 
provided herein are within the scope of the present invention. Alleles are endogenous 
genes that are altered as a result of one or more mutations, such as deletions, additions 
and/or substitutions of nucleotides. The resulting mRNA and protein may, but need not, 
have an altered structure or function. Alleles may be identified using standard 

15 techniques (such as hybridization, amplification and/or database sequence comparison). 

Therefore, in another embodiment of the invention, a mutagenesis 
approach, such as site-specific mutagenesis, is employed for the preparation of 
immunogenic variants and/or derivatives of the polypeptides described herein. By this 
approach, specific modifications in a polypeptide sequence can be made through 

20 mutagenesis of the underlying polynucleotides that encode them. These techniques 
provides a straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 
nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the 

25 use of specific oligonucleotide sequences which encode the DNA sequence of the 
desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on 
both sides of the deletion jimction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
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change the properties of the polynucleotide itself, and/or alter the properties, activity, 
composition, stability, or primary sequence of the encoded polypeptide. 

In certain embodiments of the present invention, the inventors 
contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
5 more properties of the encoded polypeptide, such as the immunogenicity of a 
polypeptide vaccine. The techm'ques of site-specific mutagenesis are well-known in the 
art, and are widely used to create variants of both polypeptides and polynucleotides. For 
example, site-specific mutagenesis is often used to alter a specific portion of a DNA 
molecule. In such embodimeaits, a primer comprising typically about 14 to about 25 
10 nucleotides or so in length is employed, with about 5 to about 10 residues on botii sides 
of the junction of the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 
mutagenesis techniques have often employed a phage vector that exists in botii a single 
stranded and double stranded form. Typical vectors useful in site-dkected mutagenesis 
15 include vectors such as the M13 phage. These phage are readily commercially-available 
and their use is generally well-known to those skilled in the art. Double-stranded 
plasmids are also routinely employed in site directed mutagenesis that eliminates the 
step of transferring the gene ofjnterest fi-om a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is 
20 performed by first obtainmg a single-sti^ded vector or melting apart of two strands of a 
double-stranded vector that includes within its sequence a DNA sequence that encodes 
the desired peptide. An oUgonucleotide primer bearing the desired mutated sequence is 
prepared, generally syntiietically. This primer is tiien annealed with the single-stranded 
vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I 
25 Klenow fi-agment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is 
tiien used to transform appropriate cells, such as £ coli ceUs, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 
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The preparation of sequence variants of the selected peptide-encoding 
DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and the DNA sequences encoding them may be 
5 obtained. For example, recombinant vectors encoding the desired peptide sequence 
may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy et al, 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
Maniatis et al, 1982, each incorporated herein by reference, for that purpose. 

10 As used herein, the term "oligonucleotide directed mutagenesis 

procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
signal, such as amplification. As used herein, the term ''oligonucleotide directed 

15 mutagenesis procedure" is intended to refer to a process that involves the 
template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 
rules of complementary base pairing (see, for example, Watson, 1987). Typically, 

20 vector mediated methodologies involve the introduction of the nucleic acid fi-agment 
into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of 
the amplified nucleic acid fragment. Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

In another approach for the production of polypeptide variants of the 

25 present invention, recursive sequence recombination, as described in U.S. Patent No. 
5,837,458, may be employed. In this approach, iterative cycles of recombination and 
screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immimogenic activity. 

In other embodiments of the present invention, the polynucleotide 

30 sequences provided herein can be advantageously used as probes or primers for nucleic 
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acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
a sequence region of at least about 15 nucleotide long contiguous sequence that has tiie 
same sequence as, or is complementary to, a 15 nucleotide long contiguous sequence 
disclosed herein will find particular utility. Longer contiguous identical or 
5 complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 
(including all inteimediate lengtiis) and even up to fiill length sequences will also be of 
use in certain embodiments. 

The ability of such nucldc acid probes to specifically hybridize to a 
sequence of interest wiU enable them to be of use in detecting the presence of 
10 complementaiy sequences in a given sample. However, other uses are also envisioned, 
• such as the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
15 or so (including intemiediate lengths as well), identical or complementary to a 
polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e.g., Soufliem and Northern blotting. This would allow a gene 
product, or fi^ent tiiereof, to be analyzed, both in diverse cell types and also in 
various bacterial cells. The total size of fragment, as well as the size of the 
20 complementary stietch(es), will ultimately depend on tiie intended use or application of 
the particular nucleic acid segment Smaller fiagments will generally find use in 
hybridization embodiments, wherein the length of the contiguous complementaiy region 
may be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 
25 complonentary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 
allows the formation of a duplex molecule that is botii stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than 15 bases in 
length are generally preferred, though, in order to increase stability and selectivity of the 
30 hybrid, and thereby improve tiie quality and degree of specific hybrid molecules 
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obtained. One will generally prefer to design nucleic acid molecules having gene- 
complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

Hybridization probes may be selected from any portion of any of the 
5 sequences disclosed herein. All that is required is to review the sequences set forth 
herein, or to any continuous portion of the sequences, from about 15-25 nucleotides in 
length up to and including the full length sequence, that one wishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
factors. For example, one may wish to employ primers from towards the termini of the 
10 total sequence. 

Small polynucleotide segments or fragments may be readily prepared by, 
for example, directly synthesizing the fragment by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic acid reproduction technology, such as the PGR™ 
15 technology of U. S. Patent 4,683,202 (incoiporated herein by reference), by introducing 
selected sequences into recombinant vectors for recombinant production, and by other 
recombinant DNA techniques generally known to those of skill in the art of molecular 
biology. 

The nucleotide sequences of the invention may be used for their ability to 
20 selectively form duplex molecules with complementary stretches of the entire gene or 
gene fragments of interest. Depending on the application envisioned, one will typically 
desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, 
one will typically desire to employ relatively stringent conditions to form the hybrids, 
25 e.g., one will select relatively low salt and/or high temperature conditions, such as 
provided by a salt concentration of from about 0.02 M to about 0.15 M salt at 
temperatures of from about 50^C to about 70°C. Such selective conditions tolerate 
little, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 
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Of course, for some applications, for example, where one desires to 
prepare mutants employing a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 
needed in order to allow formation of the heteioduplex. to these circumstances, one 
5 may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 
salt, at temperatures ranging from about 20°C to about 55°C. Cross-hybridizing species 
can thereby be readily identified as positively hybridizing signals with respect to conlK)l 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 
more stringent by the addition of increasing amounts of fonnamide, which serves to 
10 destabilize the hybrid duplex in the same mamier as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice dependmg on the desired results. 

According to another embodiment of the present invention, 
polynucleotide compositions comprising antisense oligonucleotides are provided. 
15 Antisense oUgonucleotides have been demonstrated to be effective -and targeted 
inhibitors of protein synthesis, and, consequently, provide a iher^eutic approach by 
which a disease can be treated by inhibiting the synthesis of proteins that contribute to 
the disease. The efficacy of antisense oUgonucleotides for inhibiting protein synthesis 
is well established. For example, the synthesis of polygalactauronase and the muscarine 
20 lype 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to their 
respective mRNA sequences (U. S. Patent 5.739.119 and U. S. Patent 5,759.829). 
Further, examples of antisense inhibition have been demonstrated with the nuclear 
protein cyclin, tiie multiple drug resistance gene (MDGl), ICAM-1, E-selectin, STK-1, 
stiiatal GABAa receptor and human EOF (Jaskulski et al. Science. 1988 Jun 
25 10;240(4858):1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
32; Peris et al. Brain Res Mol Brain Res. 1998 Jun 15;57(2):3 10-20; U. S. Patent 
5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patait 5,610.288). 
Antisense constructs have also been described that inhibit and can be used to treat a 
variety of abnormal cellular proliferations, e.g. cancer (U. S. Patent 5.747.470; U. S. 
30 Patent 5.591,317 and U.S. Patent 5.783,683). 
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Therefore, in certain embodiments, the present invention provides 
oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 
5 or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof In a third embodiment, the oligonucleotides are modified DNAs 
comprising a phosphorothioated modified backbone. In a fourth embodiment, the 
oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 
each case, preferred compositions comprise a sequence region that is complementary, 

10 and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 
Selection of antisense compositions specific for a given gene sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, Tm, 
binding energy, and relative stability. Antisense compositions may be selected based 

15 upon their relative inability to form dimers, hairpins, or other secondary structures that 
vy^ould reduce or prohibit specific binding to the target mRNA in a host cell. Highly 
preferred target regions of the mRNA, are those which are at or near the AUG 
translation initiation codon, and those sequences v^hich are substantially complementary 
to 5' regions of the mRNA. These secondary structure analyses and target site selection 

20 considerations can be performed, for example, using v.4 of the OLIGO primer analysis 
software and/or the BLASTN 2.0.5 algorithm software (Altschul et al. Nucleic Acids 
Res. 1997, 25(17):3389-402). 

The use of an antisense delivery method employing a short peptide 
vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 

25 hydrophobic domain derived fi*om the ftision sequence of HIV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SV40 T-antigen (Morris et ah^ 
Nucleic Acids Res. 1997 Jul 15;25(14)-2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 
into cultured mammalian cells in less than 1 hour with relatively high ejfficiency (90%). 
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Further, the interaction with MPG strongly increases both the stability of the 
oligonucleotide to nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the polynucleotide 
compositions described herein are used in the design and preparation of ribozyme 
5 molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor ceUs. Ribozymes are RNA-piotein complexes that cleave 
nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 
possess endonudease activity (Kim and Cech, Proc Natl Acad Sci USA. 1987 
Dec;84(24):8788-92; Foister and Symons, Cell. 1987 Apr 24;49(2):21 1-20). For 
10 example, a large number of ribozymes accelerate phosphoester transfer reactions with 
high degree of specificity, often cleaving only one of several phosphoesters in 
oligonucleotide substrate (Cech et al. Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol Biol. 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1992May 14;357(6374):173.6). TOs specificity has been attributed to the requirement 
15 that the substrate bind via specific base-pairing interactions to the internal guide 
sequence ("IGS") of the ribozyme prior to chemical reaction. 

Six basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds /« trans (and 
thus can cleave other RNA molecules) under physiological conditions. In genera], 
20 enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 
through the target binding portion of a enzymatic nucleic acid which is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
TTius, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 
25 cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 
cleaved its RNA target, it is released from that RNA to search for another target and can 
repeatedly bind and cleave new targets. 

The enzymatic nature of a ribozyme is advantageous over many 
30 technologies, such as antisense technology (where a nucleic acid molecule simply binds 
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to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessary to affect a therapeutic treatment is lower than that of an antisense 
oligonucleotide. This advantage reflects the ability of the ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many moleciiles of 
5 target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 
of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 
substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyntie. Similar mismatches in antisense molecules do not prevent their action 

10 (Woolf era/., Proc Natl Acad Sci USA. 1992 Aug 15;89(16):7305-9). Thus, the 
specificity of action of a ribozyme is greater than that bf an antisense oligonucleotide 
binding the same RNA site. 

The enzymatic nucleic acid molecule may be formed in a hammerhead, 
hairpin, a hepatitis S virus, group I intron or RNaseP RNA (in association with an RNA 

15 guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
described by Rossi et al Nucleic Acids Res. 1992 Sep 1 1;20(17):4559-65. Examples of 
hairpin motifs are described by Hampel et al (Eur. Pat. Appl. Publ. No. EP 0360257), 
Hampel and Tritz, Biochemistry 1989 Jun 13;28(12):4929-33; Hampel etal. Nucleic 
Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An example of the 

20 hepatitis 5 vmis motif is described by Perrotta and Been, Biochemistry. 1992 Dec 
1;3 1(47): 11843-52; an example of the RNaseP motif is described by Guerrier-Takada 
etal. Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RNA ribozyme motif is 
described by Collins (Saville and Collins, Cell. 1990 May 18;61(4):685-96; Saville and 
Collins, Proc Natl Acad Sci USA. 1991 Oct l;88(19):8826-30; Collins and Olive, 

25 Biochemistry. 1993 Mar 23;32(ll):2795-9); and an example of the Group I intron is. 
described in (U. S, Patent 4,987,071). All that is important in an enzymatic nucleic acid 
molecule of this invention is that it has a specific substrate binding site which is 
complementary to one or more of the target gene RNA regions, and that it have 
nucleotide sequences within or siirrounding that substrate binding site which impart an 
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RNA Cleaving activity to the molecule. Thm the ribozyme constructs need not be 
limited to specific motifs mentioned herein. 

Ribozymes may be designed as described in Int. Pat. Appl. Publ. No. 
WO 93/23569 and Int Pat. Appl. Publ. No. WO 94/02595. each ^ifically 
5 incoiporated herein by reference) and synthesized to be tested in vitro and in vivo, as 
described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in the art wiU recognize fliat equivalent RNA targets in 
other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of tiie 
10 ribozyme binding arms, or chemically synthesizing ribozymes with modifications that 
prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl. Publ. No. WO 
92/07065;. hit Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 
91/03162; Eur. Pat. Appl. Publ. No. 92110298.4; U. S. Patent 5,334,711; and Int. Pat 
Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can 
15 be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efiScacy in cells, and removal of stem D bases to shorten RNA synthesis 
times and reduce chemical requirements. 

Sullivan etal. (hit. Pat. Appl. Publ. No. WO 94/02595) describes the 
general methods for delivery of enzymatic RNA molecules. Ribozymes may be 
20 administered to cells by a variety of melhods known to those familiar to the art, 
including, but not restricted to, encapsulation in Hposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegiBdable 
nanoc^es, and bioadhesive microspheres. For some indications, ribozymes may be 
directly deHvered ex vivo to cells or tissues with or without the aforementioned vehicles. 
25 Alternatively, the RNA/vehicle combination may be locally delivered by direct 
inhalation, by direct injection or by use of a catheter, inlusion pmnp or stint Otiier 
routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions • 
30 of ribozyme delivery and administration are provided in Int. Pat Appl. Publ. No. WO 
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94/02595 and Int. Pat. AppL Publ. No. WO 93/23569, each specifically incorporated 
herein by reference. 

Another means of accumulating high concentrations of a ribozyme(s) 
within cells is to incorporate the ribozyme-encoding sequences into a DNA expression 
5 vector. Transcription of the ribozyme sequences are driven from a promoter for 
eukaiyotic KNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase 
III (pol ni). Transcripts from pol II or pol III promoters will be expressed at high levels 
in all cells; the levels of a given pol n promoter in a given cell type will depend on the 
nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 

10 Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate ceUs. Ribozymes 
expressed from such promoters have been shown to function in mammaUan cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 
mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 

15 vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as 
retroviral, semliki forest virus, sindbis virus vectors). 

In another embodiment of the invention, peptide nucleic acids (PNAs) 
compositions are provided. PNA is a DNA mimic in which the nucleobases are 
attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 

20 Dev. 1997 7(4) 431-37). PNA is able to be utilized in a number methods that 
traditionally have used RNA or DNA. Often PNA sequences perform better in 
techniques than the corresponding RNA or DNA sequences and have utilities that are 
not inherent to RNA or DNA. A review of PNA including methods of making, 
characteristics of, and methods of using, is provided by Corey {Trends Biotechnol 1997 

25 Jun;15(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 
that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 
such PNA compositions have been administered. 
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PNAs have 2-aminoethyl-glycine Unkages replacing the nonnal 
phosphodiester backbone of DNA (Nielsen et al. Science 1991 Dec 6;254(5037):1497. 
500; Hanvey et al. Science. 1992 Nov 27;258(5087):1481.5; Hyrup and Nielsen, 
Bioorg Med Chem. 1996 Jan;4(l):5-23). This chemist^ has three important 
5 consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides. PNAs 
are neutral molecules; secondly, PNAs are achiral. which avoids the need to develop a 
stereoselective synthesis; and thirdly, PNA synthesis uses standard Boo or Fmoc 
protocols for solid-phase peptide synthesis, although other methods, including a 
modified Merrifield method, have been used. 
10 PNA monomers or ready-made oligomers are commercially available 

fiom PerSeptive Biosystems (Fiamingham, MA). PNA syntheses by either Boc or 
Fmoc protocols are straightforward using manual or automated protocols (Norton et al, 
Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
production of chemicaUy modified PNAs or the simultaneous synthesis of families of 
1 5 closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will 
depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
can lead to deletions of one or more residues in the product. In expectation of this 
20 difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be foUowed 
by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 
providing yields and purity of product similar to those observed during the synthesis of 
peptides. 

25 Modifications of PNAs for a given application may be accomplished by 

coupling amino acids during solid-phase synthesis or by attaching compounds that 
contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility, or 

30 for specific functional requirements. Once synthesized, the identity of PNAs and their 
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derivatives can be confirmed by mass spectrometry. Several studies have made and 
utilized modifications of PNAs (for example, Norton et al, Bioorg Med Chem. 1995 
Apr,3(4):437-45; Petersen et al, J Pept Sci. 1995 May-Jun;l (3): 175-83; Orum et ah, 
Biotechniques. 1995 Sep;19(3):472-80; Footer et aL, Biochemistry. 1996 Aug 
5 20;35(33): 10673-9; GrifiBth et ah. Nucleic Acids Res. 1995 Aug ll;23(15):3003-8; 
Pardridge et al, Proc Natl Acad Sci USA. 1995 Jun 6;92(12):5592-6; BofTa et al, 
Proc Natl Acad Sci USA. 1995 Mar 14;92(6):1901-5; Gambacorti-Passerini et ah. 
Blood. 1996 Aug 15;88(4):1411-7; Armitage et al, Proc Natl Acad Sci USA. 1997 
Nov ll;94(23):12320-5; Seeger et al, Biotechniques. 1997 Sep;23(3):512-7). U.S. 
10 Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. 

Methods of characterizing the antisense binding properties of PNAs are 
discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et al 

15 (Biochemistry. 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring the 
relative binding kinetics and stoichiometry. Similar types of measurements vs^ere made 
by Jensen et al using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 

20 apparent to the skilled artisan include use in DNA strand invasion, antismse inhibition, 
mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
transcriptionally active genes, blocking of transcription factor binding, genome 
cleavage, biosensors, in situ hybridization, and the like. 

Polynucleotide Identification, Characterization and Expression 
25 Polynucleotides compositions of the present invention may be identified, 

prepared and/or manipulated using any of a variety of well established techniques (see 
generally, Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described in more detail below, by 
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screening a microairay of cDNAs for tumor-assodated expression (ie.. expression that 
is at least two fold greater in a tumor than in nonnal tissue, as detennined using a 
representative assay provided herein). Such screens may be performed, for example, 
using the microanay technology of Affymetrix. Inc. (Santa Clara, CA) according to the' 
5 manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl. 
Acad Sci. USA P5:10614-10619, 1996 and Heller et al., Proc. Natl. Acad. Sci. USA 
94:2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA 
prepared from cells expressing the proteins described herein, such as tumor cells. 

Many template dependent processes are available to amplify a target 
10 sequences of interest present in a sample. One of the best known amplification methods 
is the polymerase chain reaction (PGR™) which is described in detail in U.S. Patent 
Nos. 4,683,195, 4,683^02 and 4,800,159, each of which is incoqwrated herein by 
reference in its entirety. Briefly, in PGR™, two primer sequences are prepared which 
are complementary to regions on opposite complementary strands of the target 
15 sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g.. Tag polymerase). If the target sequence is present 
in a sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by adding on nucleotides. By raising 
and lowering the temperature of the reaction mixture, the extended primers will 
20 dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
transcription and PGR™ amplification procedure may be performed in oider to quantity 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 
known in the art. 

25 Any of a number of other template dependent processes, many of which 

are variations of the PGR ™ amplification technique, are readily known and available in 
the art. Illustratively, some such methods include the ligase chain reaction (referred to 
as LCR). described, for example, in Eur. Pat. Appl. Publ. No. 320,308 and U.S. Patent 
No. 4,883,750; Qbeta Replicase, described in PGT hitl. Pat. Appl. Publ. No 

30 PGT/US87/00880,- Strand Displacement Amplification (SDA) and Repair Ghain 
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Reaction (RCR). StiJl other amplification methods are described in Great Britain Pat. 
Appl. No. 2 202 328, and in PCX Intl. Pat. AppL Publ. No. PCT/US89/01025. Other 
nucleic acid amplification procedures include transcription-based amplification systems 
(TAS) (PCX IntL Pat. Appl. Publ. No. WO 88/10315), including nucleic acid sequence 
5 based amplification CNASBA) and 3SR. Eur. Pat. Appl. Publ. No. 329,822 describes a 
nucleic acid amplification process involving cyclically synthesizing single-stranded 
RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). PCX Intl. Pat. AppL 
Publ. No. WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hybridization of a promoter/primer sequence to a target single-stranded DNA 

10 ("ssDNA") followed by transcription of many RNA copies of the sequence. Other 
amplification methods such as "RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 
1 989) are also well-known to those of skill in the art. 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a fiill length gene fix)m a suitable library {e.g,, a tumor cDNA library) 

15 using well known techniques. Within such techniques, a library (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 
Preferably, a library is size-selected to include larger molecules. Random primed 
libraries may also be preferred for identifying 5* and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5' sequences. 

20 For hybridization techniques, a partial sequence may be labeled (e.g., by 

nick-translation or end-labeling with ^^P) using well known techniques. A bacterial or 
bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
{see Sambrook et al., Molecular Cloning: A Laboratory Manual Cold Spring Harbor 

25 Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 
be analyzed to determine the amount of additional sequence by, for example, PGR using 
a primer from the partial sequence and a primer from the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. The 

30 complete sequence may then be determined using standard techniques, which may 
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involve generating a s«ies of deletion clones. IT^e resulting overlapping secpaences can 
then assembled into a single contiguous sequence. A ftll length cDMA molecule can be 
generated by ligating suitable fiagments. using v^ell knovwj techniques. 

Alternatively, amplification techniques, such as tiiose described above 
5 can be useful for obtaining a full lengfl, coding sequence a partial cDNA sequence 
One such amplification technique is inverse PCR Triglia et al., Nucl. Acids Res 
I6:Sm, 1988), ,^ch uses restriction enzymes to generate a fiagment in the known 
region of the gene. TT^ fiagment is then circularized by intr^olecular ligation and 
used as a template for PCR >vitir divergent primers derived from tiae knovm region 
10 Wrthin an alternative approach, sequences adjacent to a partial sequence may be 
retrieved by amplification with a primer to a linker sequence and a primer specific to a 
lcno^^^ regron. The ampUfied sequences are typically subjected to a second round of 
amplification witir the same linker primer and a second primer specific to tire kno^ 
region. A variation on this procedure, which employs two primers that initiate 
15 extension in opposite dir^tions fiom tiie known sequence, is described in WO 
96/38591. Anotirer such technique is known as "rapid amplification of cDNA ends" or 
RACE, nris technique involves the use of an internal primer and an external primer 
whach hybridizes to a polyA region or vector sequence, to identify sequences tiiat are 5' 
and 3' of a known sequence. Additional techniques include capture PCR (Lagerstrom et 
20 .^•>^C/?il/./Ao*^^^/fe,:„M9.1991)andwalkingPCR(Park^ 

Res. 7P:3055^, 1991). Other methods employing ampfification may also be employed 
to obtain a firll length cDNA sequence. 

In certain instances, it is possible to obtain a firll lengtir cDNA sequence 
by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
25 that available fiom GenBank. Searches for overlapping ESTs may gener^y be 
perfomred using well known programs (e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous fi.1] length sequence. Full lengUi DNA sequences 
may also be obtained by analysis of genomic fiagments. 

In other embodiments of the invention, polynucleotide sequences or 
30 fiagments thereof which encode polj^eptides of the invention, or f^on proteins or 
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functional equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
5 be used to clone and express a given polypeptide. 

As will be understood by those of skill in the art, it may be advantageous 
in some instances to produce polypeptidC'^ncoding nucleotide sequences possessing 
non-naturally occurring codons. For example, codons preferred by a particular 
prokaryotic or exdcaryotic host can be selected to increase the rate of protein expression 
10 or to produce a recombinant RNA transcript having desirable properties, such as a half- 
life which is longer than that of a transcript generated from the naturally occurring 
sequence. 

Moreover, the polynucleotide sequences of the present invention can be 
engineered using methods generally known in the art in order to alter polypeptide 

15 encoding sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 
example, DNA shuflQing by random fragmentation and PCR reassembly of gene 
fragments and synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 

20 sites, alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences may be ligated to a heterologous sequence to 
encode a fusion protein. For example, to screen peptide libraries for inhibitors of 

25 polypeptide activity, it may be useful to encode a chimeric protein that can be 
recognized by a conimercially available antibody. A fusion protein may also be 
engineered to contam a cleavage site located between the polypeptide-encodmg 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. 
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Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Carutheis, M. H. et al. 
(1980) Nucl Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) Nucl Acids Res. 
Symp. Ser. 225-232). Alternatively, the protein itself may be produced uang diemical 
5 methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
For example, peptide synthesis can be perfonned using various solid-phase techniques 
(Roberge. J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be 
achieved, for example, using the ABI 431A Peptide Synthesizer (Peridn Elmer, Palo 
Alto, CA). 

* ^ A "«wly synthesized peptide may be substantially purified by preparative 

high perfbnnance Uquid chromatography (e.g., Creighton, T. (1 983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New Yoric. N. Y.) or other comparable 
techniques available in the art. The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
15 procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

In order to express a desired polypeptide, the nucleotide sequences 
encoding the polypeptide, or functional equivalents, may be inserted into appropriate 
20 expression vector, ie., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
known to those skilled in the art may be used to construct expression vectors containing 
sequences encoding a polypeptide of interest and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA 
25 techniques, synthetic techniques, and in vivo genetic recombination. Such techniques 
are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning. A 
Laboratory Manual. Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 
al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. 
N.Y. 
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A variety of expression vector/host systems may be utilized to contain 
and express polynucleotide sequences. These include, bxit are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed vsdth yeast expression vectors; 
5 insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (eg., Ti or 
pBR322 plasmids); or am'mal cell systems. 

The "control elements" or "regulatory sequences" present in an 

10 expression vector are those non-translated regions of the vector— enhancers, promoters, 
5* and 3' untranslated regions-which interact with host ceUular proteins to carry out 
transcription and translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 

IS For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La JoUa, Calif.) or 
PSPORTl plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mannmalian cell systems, promoters from mammalian genes or from mammalian viruses 
are generally preferred. If it is necessary to generate a cell line that contains multiple 

20 copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be 
advantageously used with an appropriate selectable marker. 

In bacterial systems, any of a nimiber of expression vectors may be 
selected depending upon the use intended for the expressed polypeptide. For example, 
when large quantities are needed, for example for the induction of antibodies, vectors 

25 which direct high level expression of fusion proteins that are readily purified may be 
used. Such vectors include, but are not limited to, the multifimctional E. coli cloning 
and expression vectors such as BLUESCRIFT (Stratagene), in which the sequence 
encoding the polypeptide of interest may be ligated into the vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 

30 galactosidase so that a hybrid protein is produced; pDSf vectors (Van Heeke, G. and S. 
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M. Schuster (1989) J. Biol Che.n. 2^:5503-5509); and the like. pGEX Vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptkles as fusion 
proteins with glutathione S-transferase (GST), h general, such fusion p«,teins ax. 
soluble and can easily be purified fiom lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the pr^ence of fbe glutathione. Proteins n^ade in such 
systems may be designed to i«:lude heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 



wUl. 



In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
10 constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
Emymol 153:516-544. 

In cases where plant expression vectors are use4 the expression of 
sequences encoding polypeptides may be driven by any of a number of promoters For 

15 example, viral promoter, such as the 35S and 19S pr.,moters of CaMV may be used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N 
(1987) EMBOJ. 6:307-31 1. Altematiwly, plant promoters such as the small subunit of 
RUBISCO or heat shock promoters may be used (Coruzzi. G. et al. (1984) EMBO J. 
3:U71.mO; Broglie. R. et al. (1984) Science 22^:838-843; and Winter, J. et al. (1991) 

20 Results Probl Cell Differ. 77:85-105). These constructs can be introduced into plant 
cells by direct DNA tr^sformation or pathogen-mediated transfection. Such techniques 
are described in a number of gener^y available reviews (see. for example, Hobbs, S. or 
Murry. L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
Hill, New York, N.Y.; pp. 191-196). 

25 An insect system may also be used to express a polypeptide of interest. 

For example, in one such system, Autographa califomica nuclear polyhedrosis virus 
(AcNPV) is used as a Vector to express foreign genes in Spodoptera frugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
non-essential region of the virus, such as the polyhedrin gene, and placed under control 

30 of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
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will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of intCTest may be expressed 
(Engelhard, E. K. et al. (1994) Proc, Natl Acad. Set 91 :3224-3227). 
5 In mammalian host cells, a number of viral-based expression systems are 

generally available. For example, in cases where an adenovirus is used as an expression 
vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 

10 to obtain a viable virus which is capable of expressing the polypeptide in infected host 
ceUs (Logan, J. and Shenk, T. (1984) Proc, Natl Acad, Set 57:3655-3659). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 

Specific initiation signals may also be used to achieve more efficient 

15 translation of sequences encoding a polypeptide of interest. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a portion 

20 Aereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the im'tiation codon should be in the correct 
reading fi-ame to ensure translation of the entire insert. Exogenous translational 
elements and im'tiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers which are 

25 appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al. {1994) Results Probl CellDiJfer. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate 
the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not Hmited to, 

30 acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
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Post-lr^Wonal pmcesstog „UcI. dcav« . "p«p™- tam of a. pottta ™, j^, 
used .0 factoe con«, i„s«ioo, folding a»i/„r ft„cti<m. Dim^ hoa c* such aa 
CHO. COS. HeLa. MDCK. HEK2M, a™l WB8. »McI, ha,. ^ «War machtac, 
a«l characWsac meeba^sma for a„cb ,oa.-.r««IaH„™, a«ivi««. ^a, be choae„ u, 
5 ««™«>tacomctmodiflca«onandpnK«aingofttef„ragapMem^ 

For Jons-tem, higb-yidd p^nteta „f ^^^j^ 
«P«asio. is ge,^y F„ ^ ^^.^^ ^ 

folynudeodd. of interest b. Wo™«i using exp,ess,„„ vectors »Ueh n«y 
coetan, vitti orig,-.s of ,epHca«o„ a„d/or endogenous exp„.sion elements and a 
sel^table marlter g»e „„ fl, „ „„ , ^^^^.^ ^ 

of the vector, cells n,ay be all„«, ,„ grow for 1-2 days in an enriched n«di, before 
^ ^ switched to selective n,edia. „f ^ ^ ^ ^ 

«s,stance to selection, and its presence allows g^Mh and r«»ve,y of cells which 
successfclly express the introduced secprences. Resis«a« clones of stably tnnrfbnned 
cells ma, be prolifetated using tissue cutane technques appropriate to the ceU type. 

Any aunrber of selection systems may be used to recover t,a«abrmed 
cell hnes. These include, bu, are not limited to. the hetpes simplex vi„,a thymidine 
hnase (Wlgier. M. et al. (1977) Ce» ,Ja23.32) and adedne phoaphoribosyltransferase 
(I^wy. I. et al. (1990) « ..:8,7-23) genes which can be employed in «.sup.. or 
20 aptsup.- cells, respecHvely. Also, amimetabolite. antibiotic or herbicide tesistance can 
be used as the basis for sdection, for example, dhfr which confa ^sistance to 
methot^ate (Wigler, M. et al. (1980) froc Nail Acai ScL 77:3567-70); „p, which 
confer, resistance to the aminoglycosides, neomycin and G-418 (Colbere^pin F et 
a, (,981) J. Mo,. BM. m,.,4); and als or pat. which confer resistanle ' to 
25 d^^orsulftto. and phosphinotrid. a^tyltransfe^se. respecdvely (M^ry. s^ra) 
Addnlonal selectable ge«s have Uen described, for ex^ple. hpB, „Wch allows cella 
to uf hze indole in place of nyptophan, or hisD, which allows cells to ntilize histinol in 

TJI'^T -^-^ 
^ «5:8047-51). The use of visible markers has gained popnlarity with such maHcers as 

30 anthocyanin^ be.a-gl„c„onidase a^ its subsh^te GUS, a«l Inciferase and ite s„bst„te 
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luciferin, being widely used not only to identify transformants, but also to quantify the 
amount of transient or stable protein expression attributable to a specific vector system 
(Rhodes, C. A. et al. (1995) Methods Mol Biol 55:121-131). 

Although the presence/absence of marker gene expression suggests that 
S the gene of interest is also present, its presence and expression need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the 
absence of marker gene fiuiction. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence under the control of a single promoter. 

10 Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells that contain and express a desired 
polynucleotide sequence may be identified by a variety of procedures known to those of 
skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA- 

15 RNA hybridizations and protein bioassay or immunoassay techniques which include, 
for example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 

20 specific for the product are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioinamimoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based umnimoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 
preferred for some applications, but a competitive binding assay may also be employed. 

25 These and other assays are described, among other places, in Hampton, R. et al. (1990; 
Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E. et al. (1983; J. Exp. Med i55:121 1-1216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid assays. Means 

30 for producing labeled hybridization or PGR probes for detecting sequences related to 
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polynudeotides include oUgohbdtog. mok ttamlatic^ «Wd,HnB or PCR 
• -.Plif catio. using a labded nudeotide. AltenaSve^. „ 
fl.««.f „a, b. doncd into a vector for tte p^ductioa of an nJiNA p»b.. Such vecu,,. 

art am c«un»=rciall, available, and may be u^d to synte RNA 
probes n, vitt, by additi™, of an appmpriato RNA p„lymen»e such as T7. T3 or SP6 
and labeled nucleotides. Ji^ procedures may be conducted using a vrtety „f 
con^ercially available kite. Snitoble reporter molecules or labels, wtach may be used 
molude radionuclides enzymes, fluorescen,, chemil»mnescent or chr»n„genic agen« 
as well as snbshates, co&ctors. inhibitors, magnetic panicles, and the Uke. 
10 Host cells tmnsfomted with a polynncleodde sequence of intetest may be 

oultuted under conditions suitable for the expression and mcoveo. of .he p,ote,n fi™. 
ceU cutare. ne p„tein pr«luccd by a recombinan, ceU m^ be secreted or contained 
, ..sacellularly depending on the sequence and/or the vector used. As will be understood 
by those Of skill in the sn. expression vectors contaming polynucleotides of fl,e 
mvention may be designed to contain signal sequences which direct sec«i™, of fl« 
encoded polypeptide thmugh a prdcyMie or eukao^c cell membrane. Other 
recombinant constmctions may be used to join se,„«ces encoding a polypeptide of 
mterest to nucleotide sequence encodmg a polypeptide domam which will fecilitate 
purification of soluble ptotems. Such purific«ion Mitating domains include, but are 
not Umited to, metal chelatmg peptides such as histidin..t„p.ophan modules that allow 
punficaaon on immobUized metals, protein A domains fl« allow puWflcation on 
.mmobili«d hnmunoglobulm. and flte domam utilized in the FLAGS extension/afilnity 
punfication system flnununex Cotp.. Seattle. Wash.). The inclusion of cleavable linker 
se<,«^ ^ as those ^cific for Factor XA or entetokinase Onvi^ogen. San Diego. 
25 Cdtt) between flte pmiflcation dom«n and the encoded pol^pUde may be used to 
ftcttae purification. One such expression vector pmvides for exp^ssion of a fusion 
protein containing a polypeptide „f interest and a nucleic acid encoding 6 hisddme 
residues preceding a thio^oxin or an entetokinase cleavage site. Tkc histidm. residues 
facilitete purificanon on MAC (immobilize metal ion afftu^ chmmatography) as 
30 descnbed in Po^th, , « .1. (m2. fro, E^. f^f 3:263-^ while the enterokinase 
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cleavage site provides a means for purifying the desired polypeptide from the fusion 
protein, A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 
et al. (1 993; DNA Cell Biol J2'M 1 -453). 

In addition to recombinant production methods, polypeptides of the 
5 invention, and fiagpients thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merrifield J. (1963) Am, Chem, Soc. 55:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 
10 synthesized separately and combined using chemical methods to produce the full length 
molecule. 

Antibody Compositions, Fragments Thereof and Other Binding Agents 

According to another aspect, the present invention fiirther provides 
binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit 

15 immunological binding to a tumor polypeptide disclosed herein, or to a portion, variant 
or derivative thereof. An antibody, or antigen-binding fragment thereof, is said to 
"specifically bind," "immunogically bind," and/or is "immunologically reactive" to a 
polypeptide of the invention if it reacts at a detectable level (within, for example, an 
ELISA assay) with the polypeptide, and does not react detectably with umelated 

20 polypeptides under similar conditions. 

Immunological binding, as used in this context, generally refers to the 
non-covalent interactions of the type which occur between an immunoglobulin 
molecule and an antigen for which the immunoglobulin is specific. The strength, or 
affinity of immunological binding interactions can be expressed in terms of the 

25 dissociation constant (K<j) of the interaction, wherein a smaller Kd represents a greater 
affinity. Immunological binding properties of selected polypeptides can be quantified 
using methods well known in the art. One such method entails measuring the rates of 
antigen-binding sit^antigen complex formation and dissociation, wherein those rates 
depend on the concentrations of the complex partners, the affinity of the interaction, and 
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on geometric parametei. that equally influence the me in both directions. n,us both 
the "on Me constant" (KW and the "off rate constant" M can be detenninld by 
calculation of the concentrations and the actual rates of association and dissociation 

T^^^t-^ofKoff/Konenablescancellationofallparametersnotr^latedtoaffinity andis 
^us equal to the dissociation constant K. Se^ generally. Davies et al. (1990) Annua, 
Rev. Biochem. 59:439-473. 

An ■antig.-bi.rftog ^' or 'Imang porton" „f a„ ^^u^ ,„ 
the part of fte tamunogloWin moMe to, partcipetes i„ a„,ige„ 
antgen bWtag rite i. fo^ed b, ^^y^ ^^^^ 

10 »^ns of U.e heavy (■„., and ligW CV) cha,-.s. Itae MgM, div«^ 

*. V «gi<», of u,e ieavy a»d ligh, chains are refe„.d u, as TypervariaNe 
r=g.ons» which are in,e^.ed between Mote conserved flanldng stretches known as 
iramework regions,- or W. TTus the tenn "FR. „ft,s to annno add sequences 
wh.ch are natnraUy fo^d between and adjac«« to hypervariabl. regions in 
.nnnunoglobulins. In an antibody ™„,ec„,e, the three hype^iable regi™^, of . Hgh, 

charn^d the d^hypervariableregi^^ofaheavychato are disposed relati^to each 
other ,n three din^nsionl space to f„™ antigen-binding s.^. The andgen- 
bu^dmg s^faee is con,plen,en.«y to the dimensional snrfece of a bound andgen, 
and Ure three hypervariable regions „f each of fl» heavy »k. light chahis are tefe^d 
■!« as romplementarity^etennining regions,- or "CDRs." 

Binding agents n«y be ftather capable of di»,«i.ting between parents 
»* and wiflK,nt a cancer, snch as Inng cancer, using tte representative assays provided 
l-«n. For «,a.„,e. anybodies or other binding agents that bind to a tunror protein 
«n preferabfe. generate a signal indicating the presence of a cancer in a, least about 
Of paaems With dre disease, n,.^ p.ft„bly at leas, about 30% of parents 
A^tenutiveiy. or in addition, the a.tib«ly will geneme a negative signal indicating tte 
absence of the disease in at least about 90% of individuals without the cancer To 
<l«emdne whether a binding agent satisfies this ■equiren.ent, biological san„les (e. 
blood, sera, spntu™. „i„e a„a/or tun,or biopsies) Son, patients wrth and without a 
30 -"'-(■^'fctennined using standard clinical t«s) May be assayed as desoHbedhe^ 
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for the presence of polypeptides that bind to the binding agent. Preferably, a statistically 
significant number of samples with and without the disease will be assayed. Each 
binding agent should satisfy the above criteria; however, those of ordinary skill in the 
art will recognize that binding agents may be used in combination to improve 
5 sensitivity. 

Any agent that satisfies the above requirements may be a binding agent 
For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule or a polypeptide. In a preferred embodiment^ a binding agent is an 
antibody or an antigen-binding firagment thereof. Antibodies may be prepared by any of 

10 a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In 
general, antibodies can be produced by cell culture techniques, including the generation 
of monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 

15 recombinant antibodies. In one technique, an inmiimogen comprising the polypeptide is 
initially injected into any of a wide variety of manunals (e.g,, mice, rats, rabbits, sheep 
or goats). In this step, the polypeptides of this invention may serve as the immunogen 
without modification. Alternatively, particularly for relatively short polypeptides, a 
superior immune response may be elicited if the polypeptide is joined to a carrier 

20 protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen 
is injected into the animal host, preferably according to a predetermined schedule 
incorporating one or more booster immunizations, and the animals are bled periodically. 
Polyclonal antibodies specific for the polypeptide may then be purified from such 
antisera by, for example, affinity chromatography using the polypeptide coupled to a 

25 suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur J, 
Immunol (5:511-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the 

30 desired specificity (/.e., reactivity with the polypeptide of interest). Such cell lines may 
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be pnxiuced. for example, fiom spleen ceUs obtained fiom an animal immunized as 
described above. TT,e spleen cells a« then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
ammal. A variety offusiontedmiques may be employed. For example, the spleen cells 
5 and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma ceUs. A piefened selection technique uses HAT (hypoxanthine 
aminopteiin, thymidine) selection. After a sufBcient time, usually about I to 2 weeks' 
colonies of hybrids are observed. Single colonies are selected and their culturJ 

10 supematants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
15 vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed fiom the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affim'ty chromatography step. 
20 A nmnber of therapeuticaUy usefial molecules are known in the art which 

comprise antigen-binding sites that are capable of exhibiting immunological binding 
pmperties of an antibody molecule, lire proteolytic enzyme papain preferentially 
cleaves IgG molecules to yield several Augments, two of which (the "F(ab)" fiagments) 
each comprise a covalent heterodimer that includes an intact antigen-binding site The 
25 enzyme pepsin is able to cleave IgG molecules to provide several fragments, including 
the 'Tiab'h " fragment which comprises both antigen-binding sites. An "Fv" fragment 
can be produced by preferential proteolytic cleavage of an IgM. and on rare occasions 
IgG or IgA immunoglobulin molecule. Fv fragments are, however, more commonly 
derived using recombinant techniques known in the art. lire Fv fragment includes a 
30 non-covalent Vh::V, heterodimer including an antigen-binding site which retains much 
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of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) 
Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

A single chain Fv ("sFv") polypeptide is a covalently linked Vh::Vl 
5 heterodimer which is expressed &om a gene fusion including Vh- and VL-encoding 
genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
USA 85(16):5879-5883. A number of methods have been described to discern chemical 
structures for converting the naturally aggregated-but chemically separated-light and 
heavy polypeptide chains from an antibody V region into an sFv molecule which will 

10 fold into a three dimensional structure substantially similar to the structure of an 
antigen-binding site. See, e.g., U.S. Pat. Nos. 5,091,513 and 5,132,405, to Huston et al.; 
and U.S. Pat. No. 4,946,778, to Ladner et al. 

Each of the above-described molecules includes a heavy chain and a 
light chain CDR set, respectively interposed between a heavy chain and a light chain FR 

15 set which provide siq>port to the CDRS and define the spatial relationship of the CDRs 
relative to each other. As used herein, the term "CDR set" refers to the three 
hypervariable regions of a heavy or light chain V region. Proceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "CDRl," "CDR2," and 
"CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 

20 comprising the CDR set from each of a heavy and a light chain V region. A polypeptide 
comprising a single CDR, (e.g., a CDRl, CDR2 or CDRS) is referred to herein as a 
"niolecular recognition unit." Crystallographic analysis of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs form extensive 
contact with bound antigen, wherein the most extensive antigen contact is with the 

25 heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for 
the specificity of an antigen-binding site. 

As used herein, the term "FR set" refers to the four flanking amino acid 
sequences which frame the CDRs of a CDR set of a heavy or light chain V region. 
Some FR residues may contact bound antigen; however, FRs are primarily responsible 

30 for folding the V region into the antigen-binding site, particularly the FR residues 
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d.r«ly u> tte CDKS. Wiau. FRs. ce«A a,Ki <««to 

^ ^ hW, m fl^. an V .gi„„ eo„«„ a„ 

^ Oi»ladel«prf„„^90«».aoid^idu«. Wh™ a. V regions foM i„,„a 
b„rf^^ CDfe a„ display ^ec«.g ,„„p ^^j,, ^ 

.^face. I. ge^raUy fe, are co,^„ed *.cM^ .g,„„, „f 

Ffc wMch Umu^ to MdM Aapc Of CDR loop. i„,„ oe*i„ ..a^^,,,,, 
s»ucto«-regaril«a of «« cDR a™„o acid sequence. F„*e, certain FR 

"Sidnes are kn„™ to partcipate in no„-covale„, interdon^ co.««s «hicl, aaWlize 
fte mteractton of tlM antiljody Iieavy and liglit cliains. 

>» '^■'""'^'"■■^""feed- antibody n„,lecules comprising an anaga^ 

tad,ng site derived fion= a non-h»a„ tonnnogiobulin been describ«i. including 
clnmeric antibodies having roden, v ^ons and associated CDR. ftsed « hn„«, 
constant domains (Winter e, al. (1991) Natee 349:293-299; LobngUo e. al. (1989) 
P-oe. Na.. Acad. Sci. USA 86:422<M224; Shaw « al. (1987) , tan,„n„,. ,3g:4534- 
4538; and Brown e. al. (1987) Cancer Re^ 47:3577-3583), roden. CDRs grafted into a 
hu^an supporting FR prior to iW™, an ap^ate human antib«iy cons«n, 
^ma^. (R^echmann e. al. ,1988) Natine 332:323-327; VeriK,eyen e. al. ,1988) Science 
239:1534-1536; and Jbnes et .1. ,,986) Na«„e 321:522-525), and .wtan, CDRs 
sup^^ by ^mbinantiy vene^d roden. FRs (European Paten. Publication No. 
519,596, published Dec. 23, ,992). Uese T,nn«™^. ^ ,^ 

-u^ ».wan.ed in^nndogical response toward r»,en. antihun^n, antibody 
"K-lecules wtach ,inn.s to duration ^ eff.c«v«^s of taapeutic applic^ions of 
those moieties in human recipients. 

25 pp. . and -recombinanUy veneered 

25 FRs ■^««««--'ec«ve.,Ucen,en.ofFRresiduesfiom.e.,..ar„den,heavy„rligh. 
*au, V .g,o„ »iti, human PR residues in order to proWde . xenogeneic n.o,e«* 
-npn^g an antigen-binding s,te which retains substantiaUy .1, of to native FR 
polypeptide folding suucta„. Veneering techniques .,. based on to under^g «... 

toh^b„du,gcharacteristicsofana„tige.-bindmgsi.e..ede.en.h«dpriman> 
to shnch^ and relative disposition of to heavy and ligh. chain CDR s.« wititin to 
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antigen-binding surface. Davies et al. (1990) Ann. Rev. Biochem. 59:439-473. Thus, 
antigen binding specificity can be preserved in a humanized antibody only wherein the 
CDR structures, their interaction with each other, and their interaction with the rest of 
the V region domains are carefully maintained. By using veneering techniques, exterior 
5 (e.g., solvent-accessible) FR residues which are readily encountered by the immune 
system are selectively replaced with human residues to provide a hybrid molecule that 
comprises either a weakly hnmunogenic, or substantially non-immunogenic veneered 
surface. 

The process of veneering makes use of the available sequence data for 

10 human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of 
Immimological Interest, 4th ed., (U.S. Dept. of Health and Human Semces, U.S. 
Government Printing Office, 1987), updates to the Kabat database, and other accessible 
U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be deduced from the known three-dimensional structure for 

15 human and murine antibody fragments. There are two general steps in veneering a 
murine antigen-binding site. Initially, the FRs of the variable domains of an antibody 
molecule of interest are compared with corresponding FR sequences of hiunan variable 
domains obtained from the above-identified sources. The most homologoiis himian V 
regions are then compared residue by residue to corresponding murine amino acids. The 

20 residues in the murine FR which differ from the human counterpart are replaced by the 
residues present in the human moiety using recombinant techniques well known in the 
art. Residue switching is only carried out with moieties which are at least partially 
exposed (solvent accessible), and care is exercised in the replacement of amino acid 
residues which may have a significant effect on the tertiary structure of V region 

25 domains, such as proline, glycine and charged amino acids. 

In this manner, the resultant "veneered" murine antigen-binding sites are 
thus designed to retain the murine CDR residues, the residues substantially adjacent to 
the CDRs, the residues identified as buried or mostiy buried (solvent inaccessible), the 
residues believed to participate in non-covalent (e,g., electrostatic and hydrophobic) 

30 contacts between heavy and light chain domains, and the residues from conserved 
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structural regions of the FRs which are believed to influence the "canonical" tertiary 
stn^ctures of the CDR loops. These design criteria are then used to prepare recon.bmant 
nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
munne antigen-bhiding site into human-appearing FRs that can be used to tnaisfect 
5 manunalian cells for the expression of recombinant human antibodies whid, exhibit the 
antigen specificity of the murine antibody molecule. 

hi another embodunent of the mvention, monoclonal antibodies of the 
present invention may be coupled to one or more therapeutic agents. Suitable agents in 
this regard include radionuclides, differentiation inducer., drugs, toxins, and derivatives 

10 thereof. Preferred radionuclides include '^^t '^w i86p^ mr, 2iu 

2] 2 ' ^' ^' ^> Ke, At, and 

Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 
toxms include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas 
exotoxin. Shigella toxin, and pokeweed antiviral protein. 
15 A therapeutic agent may be coupled (e.g., covalently bonded) to a 

suitable monoclonal antibody either direcUy or indirectly (e.g., via a linker group) A 
direct reaction between an agent and an antibody is possible when each possesses a 
substituent capable of reacting ^th the other. For example, a nucleophilic group, such 
as an amino or sulfliydryl group, on one may be enable of rating with a carbonyl- 
20 contaim-ng group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group {e.g., a halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an 
antrbody fiom an agent in order to avoid interference with binding capabilities. A linker 
25 group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibody, and thus increase the coupHng efficiency. An increase in chemical 
reactivity may also facilitate the use of agents, or functional groups on agents, which 
othmnse would not be possible. 

It will be evident to those skilled in the art that a variety of bifimctional 
30 or polyfimctional reagents, both homo- and hetero-fiinctional (such as those described in 

75 



wo 02/00174 



PCT/USOl/21065 



the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Couphng may be effected, for example, through ammo groups, carboxyl groups, 
sulfhydxyl groups or oxidized carbohydrate residues. There are numerous references 
describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 
5 Where a therapeutic agent is more potent when free from the antibody 

portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker groiq) which is cleavable during or upon intemalization into a cell. A number of 
different cleavable linker groups have been described. The mechanisms for the 
intracellular release of an agent from these linker groups include cleavage by reduction 

10 of a disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a 
photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter etal.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by 
serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 

15 It may be desirable to couple more than one agent to an antibody. In one 

embodim^t, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be 

20 coupled directly to an antibody molecule, or linkers that provide multiple sites for 
attachment can be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 

25 such as aminodextran (e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionucUde agents include radiohalogenated small molecules and chelating 
compounds. For example, U.S. Patent No. 4,735,792 discloses representative 

30 radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
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formed from chelating compounds that include those containing nitrogen and sullur 
atoms as the donor atoms for bmding the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4.673.562. to Davison et al. discloses representative chelating 
compounds and their synthesis. 

^ T Cell Com positions 

The present invention, in another aspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof Such cells 
may generally be prepared in vitro or ex vivo, using standar d procedures. For example 
T cells may be isolated from bone man.,w. peripheral blood, or a fraction of bone' 
10 marrow or peripheral blood of apatient, using a commercially available cell separation 
system, such as the Isolex™ System, available from Nexell H^erapeutics. Inc. (Irvine 
CA; see also U.S. Patent No. 5.240,856; US. PatentNo. 5.215.926; WO 89/06280- WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived fi«n r^latld or 
unrelated humans, non-human mammals, cell lines or cultures. 
15 T cells may be stimulated with a polypeptide, polynucleotide encoding a 

polypeptde and/or an antigen presenting cell (APQ that expresses such a polypeptide 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T cells that are specific for the polypeptide of interest Preferably, a tumor 
polypeptide or polynucleotide of the invention is present within a delivery vehicle, such 
20 as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the present 
mvention if the T cells specificaUy proliferate, secrete cytokines or Mil target cells 
coated with the polypeptide or expressing a gene encoding the polypeptide T cell 
specificity may be evaluated using any of a variety of ^andard techniques For 
25 example, within a chromium relea^ assay or proliferation assay, a stimulation index of 
more than two fold increase i„ lysis and/or proliferation, compared to negative controls 
mdicates T cell specificity. Such assays may be performed, for example, as described in' 
Chen et al.. Cancer Res. 5^:1065-1070, 1994. Alternatively, detection of the 
proliferation of Tcells may be accomplished by a variety of known techniques. For 

77 



wo 02/00174 



PCTAJSOl/21065 



example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
tumor polypeptide (100 ng/ml - 100 |ig/ml, preferably 200 ng/ml - 25 jig/ml) for 3 - 7 
5 days will typically result in at least a two fold increase in proliferation of the T cells. 
Contact as described above for 2-3 hours should result in activation of the T ceUs, as 
measured using standard cytokine assays in which a two fold increase in the level of 
cytokine release (e.g., TNF or IFN-y) is indicative of T cell activation (see Coligan et 
al.. Current Protocols in Immunology, vol. 1, Wiley Interscience (Greene 1998)). T 

10 cells that have been activated in response to a tumor polypeptide, polynucleotide or 
polypeptide-expressing APC may be CD4**' and/or CDS'**. Tumor polypeptide-specific T 
cells may be expanded using standard techniques. Within preferred embodiments, the T 
cells are derived from a patient, a related donor or an unrelated donor, and are 
administered to the patient following stimulation and expansion. 

15 For therapeutic purposes, €04"*" or CDS"^ T cells that proliferate in 

response to a timior polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 

20 without the addition of T cell growth factors, such as inta:leukin-2, and/or stimulator 
cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 
cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 

25 

T Cell Receptor Compositions 

The T cell receptor (TCR) consists of 2 different, highly variable 
polypeptide chains, termed the T-cell receptor a and P chains, that are linked by a 
disulfide bond (Janeway, Travers, Walport. Immunobiology. Fourth Ed., 148-159. 
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Elsevier Science Ltd/Garland Publishing. 1999). The a/p hete«xlimer complexes vdth 
the invariant CD3 chains at the cell membrane. This complex recognizes specific 
antigenic peptides bound to MHC molecules. The enonnous diversity of TCR 
specificities is generated much hke immunoglobulin diversity, through somatic gene 
5 rean^gement. The p chain genes contain over 50 variable (V), 2 divemty (D). over 10 
joining (J) segments, and 2 constant region segments (C). The a chain genes contain 
over 70 V segments, and over 60 J segments but no D segments, as well as one C 
segment. During T ceU development in the thymus, the D to J gene i^anangement of 
the p chain occurs, followed by the V gene segment rearrangement to the DJ. IWs 
10 functional VDJp exon is transcribed and spliced to join to a Cp. For the a chain, a V„ 
gene segment rearranges to a J„ gene segment to create the functional exon that is then 
transcribed and spliced to the C„. Diversity is further i«:reased during the 
recombination process by the landom addition of P and N-nucleotides between the V 
D, and J segments of the p chain and between the V and J segments in the a chain' 
15 (Janeway. Travers. Walport. Immmobiology. Fourth Ed.. 98 and 150. Elsevier Science 
Ltd/Garland PubUshing. 1999). 

The present invention, in another aspect, provides TCRs specific for a 
polypeptide disclosed herein, or for a variant or derivative thei^of In accordance with 
the present invention, polynucleotide and amino acid sequences are provided for the V-J 
or V-D-J junctional regions or parts thereof for the alpha and beta chains of the T-cell 
raptor which recognize tumor polypeptides described herein. In gene^l, this aspect 
of the mvention relates to T..1] receptors which recognize or bind tumor polypeptides 
presented in the context of MHC. In a prefe^ed embodiment ti,e tumor antigens 
recogmzed by ti,e T-cell .^ceptors comprise a polypeptide of tiie p,^ invention For 
example, cDNA encoding a TCR specific for a _tumor peptide can be isolated from T 
cells specific for a tumor polypeptide using standard molecular biological and 
recombinant DNA techniques. 

This invention further includes the T-cell receptors or analogs thereof 
havmg substantially the same fimction or activity as fl,e T-cell receptor, of this 
30 mvention which recognize or bind ti«nor polypeptides. Suchr^eptors include, but are 
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not limited to, a fragment of the receptor, or a substitution, addition or deletion mutant 
of a T-cell receptor provided herein. This invention also encompasses polypeptides or 
peptides that are substantially homologous to the T-cell receptors provided herein or 
that retain substantially the same activity. The term "analog" includes any protein or 
5 polypeptide having an amino acid residue sequence substantially identical to the T-cell 
receptors provided herein in which one or more residues, preferably no more than 5 
residues, more preferably no more than 25 residues have been conservatively substituted 
with a functionally similar residue and which displays the functional aspects of the T- 
cell receptor as described herein. 

10 The present invention further provides for suitable mammalian host 

cells, for example, non-specific T cells, that are transfected with a polynucleotide 
encoding TCRs specific for a polypeptide described herein, thereby rendering the host 
cell specific for the polypeptide. The a and p chains of the TCR may be contained on 
separate expression vectors or alternatively, on a single expression vector that also 

IS contains an intemal ribosome entry site (IRES) for cap-independent translation of the 
gene downstream of the IRES. Said host cells expressing TCRs specific for the 
polypeptide may be used, for example, for adoptive immunotherapy of lung cancer as 
discussed further below. 

In further aspects of the present invention, cloned TCRs specific for a 
20 polypeptide recited herein may be used in a kit for the diagnosis of lung cancer. For 
example, the nucleic acid sequence or portions thereof, of tumor-specific TCRs can be 
used as probes or primers for the detection of expression of the rearranged genes 
encoding the specific TCR in a biological sample. Therefore, the present invention 
further provides for an assay for detecting messenger RNA or DNA encoding the TCR 
25 specific for a polypeptide. 

Pharmaceutical Compositions 

In additional embodiments, the present invention concerns formulation 
of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions 
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disclosed herein in phannaceutically-acceptable carriers for administration to a cell or 
an animal, either alone, or in combination with one or xook other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed herein 
may be administered m combination with otiier agents as well, such as, e.g., other 
5 proteins or polypeptides or various phaimaceutically-active agents. In fact, there is 
virtually no limit to other components that may also be included, given that the 
additional agents do not cause a significant adverse effect upon contact with the target 
cells or host tissues. The compositions may thus be delivered along with various other 
agents as required in the particular instance. Such compositions may be purified fix)m 
10 host cells or otiier biological sources, or alternatively may be chemically synthesized as 
described herein, yicewise, such compositions may further comprise substituted or 
derivatized RNA or DNA compositions. 

Therefore, in anotiier aspect of the present invention, pharmaceutical 
compositions are provided comprising one or more of the polynucleotide, polypeptide, 
15 antibody, and/or T-ceU compositions described herein in combination witii I 
physiologically accqrtable carrier. certain preferred embodiments, flie 

phamaceutical compositions of the invention comprise immunogenic polynucleotide 
and/or polypeptide compositions of tiie invention for use in prophylactic and tiieraputic 
vaccme appKcations. Vaccine preparation is generally described in, for example. M.F. 
20 Powell and M.J. Ne^ eds., "Vaccine Design (the subunit and adjuvant approach)," 
Plenmn Press (NY, 1995). Generally, such compositions will comprise one or more 
polynucleotide and/or polypeptide compositions of±e present invention in combination 
wifli one or more immunostimulants. 

It will be apparent tiiat any of tiie pharmaceutical compositions described 
25 herein can contain phaimaceutically acceptable salts of the polynucleotides and 
polypeptides of tfie invention. Such salts can be prepared, for example, from 
pharmaceutically acceptable non-toxic bases, including organic bases (e.g.. salts of 
primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e.g., 
sodium, potassium, litiiium, ammonium, calcium and magnesium salts). 
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In another embodiment, illustrative immunogenic compositions, e.g., 
vaccine compositions, of the present invention comprise DNA encoding one or more of 
the polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of delivery 
S systems known to those of ordinary skill in the art. Indeed, numerous gene delivery 
techniques are well known in the art, such as those described by RoUand, Crit Rev. 
Therap. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 
Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 

10 promoter and terminating signal). Alternatively, bacterial delivery systems may involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that expresses an 
immimogenic portion of the polypeptide on its cell surface or secretes such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding 
immunogenic polypeptides described herein are introduced into suitable mammalian 

15 host cells for expression using any of a number of known viral-based systems. In one 
illustrative embodiment, retroviruses provide a conv^ent and effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 
present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered 

20 to a subject A number of illustrative retroviral systems have been described (e.g., U.S. 
Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; MiUer, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Bums 
et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet. Develop. 3:102-109. 

25 In addition, a number of illustrative adenovirus-based systems have also 

been described. Unlike retroviruses which integrate into the host genome, adenoviruses 
persist extrachromosomally thus minimizing the risks associated with insertional 
mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. 
Virol. 67:591 1-5921; Mittereder et al, (1994) Human Gene Therapy 5:717-729; Seth et 

30 al. (1994) J. Virol. 68:933-940; Barret al. (1994) Gene Therapy 1:51-58; Berkner, K, L. 
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(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 
developed for polynucleotide delivery. AAV vectors can be readily constructed using 
5 techniques well known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; 
International Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. 
(1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533- 
539; Muzyczka, N. (1992) Current Topics in Microbiol, and hnmunol. 158:97-129; 
10 Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene 
Therapy 1:165-169; and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 

Additional viral vectors use&l for delivering the polynucleotides 
encoding polypeptides of the present mvention by gene transfer include those derived 
from the pox family of viruses, such as vaccim'a virus and avian poxvirus. By way of 
15 example, vaccinia virus recombinants expressing the novel molecules can be 
constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence oicoding thymidine kinase (TK). Tins vector is 
then used to transfect cells which are sunultaneously infected with vaccmia. 
20 Homologous recombination serves to insert the vaccinia promoter phjs the gene 
encoding the polypeptide of interest into the viral genome. The resultmg TK.sup.(-) 
recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

A vaccinia-based infection/transfection system can be conveniently used 
25 to provide for inducible, transient expression or coexpression of one or more 
polypeptides described herein in host cells of an organism, hi this particular system, 
cells are first infected in vitro with a vaccinia yvms recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. FoUowing infection, cells are 
30 transfected with the polynucleotide or polynucleotides of interest, drivai by a T7 
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promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high level, 
transient, cytoplasmic production of large quantities of RNA and its translation 
5 products. See, e.g., Ehoy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 
6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding sequences of interest. Recombinant avipox 
viruses, expressing immunogens from mammalian pathogens, are known to confer 

10 protective immunity when administered to non-avian species. The use of an Avipox 
vector is particularly desirable in human and other mammalian species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
therefore are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 

15 above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545. 

Any of a number of alphavirus vectors can also be used for delivery of 
polynucleotide compositions of the present invention, such as those vectors described in 
U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based 

20 on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 

Moreover, molecular conjugate vectors, such as the adenovirus chimeric 
vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al. Proc. Nad. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene deliveiy 

25 under the invention. 

Additional illustrative information on these and other known viral-based 
delivery systems can be found, for example, in Fisher-Hoch et al., Proc, Natl Acad Sci 
USA 55:317-321, 1989; Flexner et al., Anft KY. Acad. Sci. 55P:86-103, 1989; Flexner 
et al.. Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; 

30 WO 89/01 973; U.S. Patent No. 4,777,127; GB 2,200,65 1 ; EP 0,345,242; 
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WO 91/02805; Berkner, Biotechniques 5:616-627. 1988; Rosenfeld et al.. Science 
252:431-424, 1991; Kolls et al., Proc. Natl. Acad Sci. USA Pi:215-219, 1994; 
Kass-Eisler et al., Proc. Natl. Acad. ScL USA P0:1 1498-1 1502, 1993; Guzman et al.. 
Circulation 5^2838-2848, 1993; and Guzman et al., Cir. Res. 75:1202-1207, 1993. 
5 In certain embodiments, a polynucleotide may be integrated into the 

genome of a target cell. This integration may be in the specific location and orientation 
via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the 
polynucleotide may be stably maintained in the cell as a separate, episomal segment of 
10 DNA. Such polynucleotide segments or "episomes" encode sequences sufficient to 
pemiit maintenance and rephcation independent of or in synchronization with the host 
cell cycle. The manner in which the expression construct is delivered to a ceU and 
where in the cell the polynucleotide remains is dependent on the type of expression 
construct employed. 

^ another embodiment of the mvention, a polynucleotide is 
administered/delivered as "naked" DNA, for example as described in Uhner et al., 
5fc/g«ee:2JP:1745-1749, 1993 and reviewed by Cohen, Science 25^:1691-1692, 1993. 
The uptake of naked DNA may be increased by coating the DNA onto biodegradable 
beads, which are ^ciently transported into the cells. 

stiU another embodiment, a composition of the present invention can 
be delivered via a particle bombardmoit approach, many of which have been described. 
M one illustrative example, gas-driven particle acceleration can be achieved with 
devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxfoid, UK) 
and Powderject Vaccines he. (Madison, WI), some examples of which are described in 
25 U.S. Patent Nos. 5,846,796; 6.010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 
799. This approach offers a needle-free delivery approach wherein a dry powder 
formulation of microscopic particles, such as polynucleotide or polypeptide particles, 
are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelling the particles into a target tissue of interest. 
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In a related embodiment, other devices and methods that may be usefid 
for gas-driven needle-less injection of compositions of the present invention include 
those provided by Bioject, Inc. (Portland, OR), some examples of which are described 
in U.S. Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 
5 and 5,993,412. 

According to another embodiment, the pharmaceutical compositions 
described herein will comprise one or more immunostimulants in addition to the 
inamunogenic polynucleotide, polypeptide, antibody, T-cell and/or APC compositions 
of this invention. An immunostimulant refers to essentially any substance that enhances 

10 or potentiates an inmiune response (antibody and/or cell-mediated) to an exogenous 
antigen. One preferred type of immunostimulant comprises an adjuvant. Many 
adjuvants contain a substance designed to protect the antigen jfrom rapid catabolism, 
such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such 
as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 

15 Certain adjuvants are commercially available as, for example, Freuntfs Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 
65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, . 
PA); aluminum salts such as aluminum hydroxide gel (alimi) or aluminum phosphate; 
salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated 

20 sugars; cationically or anionically derivatized polysaccharides; polyphosfphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 
GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be used as 
adjuvants. 

Within certain embodiments of the invention, the adjuvant composition 
25 is preferably one that induces an immune response predominantly of the Thl type. High 
levels of Thl-type cytokines (e.g., IFN-y, TNFo, IL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 
high levels of Th2-type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the 
induction of himioral immune responses. Following application of a vaccine as 
30 provided herein, a patient will support an immune response that includes Thl - and Th2- 
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type responses. Within a piefened embodiment, in which a response is predominantly 
TTil-type, the level of Thl-type cytokines will increase to a greater extent than the level 
of Th2-type cytokines. TTie levels of these cytokines may be readily assessed using 
standard assays. For a review of the femilies of cytokines, see Mosmann and Coffinan, 
5 Ann. Rev. Immunol 7.145-173, 1989. 

Certain prefened adjuvants for eliciting a predominantly ITil-type 
response include, for example, a combination of monophosphoiyl lipid A, preferably 3- 
de-O-acylated monophosphoiyl lipid A, together with an aluminum salt. MPL® 
adjuvants are available fiom Corixa Corporation (Seattle, WA; see, for example. US 
10 Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-contaimng 
oligonucleotides (in which the CpG dinucleotide is umne%lated) also induce a 
predominantly Thl response. Such oligonucleotides are weU known and are described, 
for example, in WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008.200 and 
5,856,462. Immunostimulatoiy DNA sequences are also described, for example, by 
15 Sato et al.. Science 273:352, 1996. Another preferred adjuvant comprises a sapoiiin. 
such as Quil A, or derivatives thereof, mcluding QS21 and QS7 (AquUa 
Biopharmaceuticals Inc.. Fiamingham. MA); Escin; Digitonin; or Gypsophila or 
Chenopodium guinea saponins . Other preferred formulations include more than one 
saponin in the adjuvant combinations of the present invention, for example 
20 combinations of at least two of the following group comprising QS21, QS7, Quil A, p- 
escin, or digitonin. 

Alternatively the saponin formulations may be combined with vaccine 
vehicles composed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix. 

25 particles composed of polysaccharides or chemically modified polysaccharides] 
liposomes and lipid-based particles, particles composed of glycerol monoesters, etc. Thi 
saponins may also be formulated in the presence of cholesterol to fomi particulate 
structures such as liposomes or ISCOMs. Furthermore, the saponins may be formulated 
together with a polyoxyethylene ether or ester, in either a non-particulate solution or 

30 suspension, or in a particulate structure such as a paucilamelar liposome or ISCOM. The 
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saponins may also be fonnulated with excipients such as Carbopol^ to increase 
viscosity, or may be formulated in a dry powder form with a powder excipient such as 
lactose. 

In one preferred embodiment, the adjuvant system includes the 
S combination of a monophosphoiyl lipid A and a saponin derivative, such as the 
combination of QS21 and 3D-MPL® adjuvant, as described in WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched with cholesterol, as described in 
WO 96/33739. Other preferred formulations comprise an oiMn-water emulsion and 
tocopherol. Another particularly preferred adjuvant formulation employing QS21, 3D- 
10 MPL® adjuvant and tocopherol in an oil-in-water emulsion is described in WO 
95/17210. 

Another enhanced adjuvant system involves the combination of a CpG- 
containing oligonucleotide and a saponin derivative particularly the combination of 
CpG and QS21 is disclosed in WO 00/09159. Preferably the formulation additionally 

15 comprises an oil in water emulsion and tocopherol. 

Additional illustrative adjuvants for use in the pharmaceutical 
compositions of the invention include Montanide ISA 720 (Seppic, France), SAF 
(Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series 
of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 

20 Belgium), Detox (Enhanzyn®) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, 
MT) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described 
m pending U.S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the 
disclosures of which are incorporated herein by reference in their entireties, and 
polyoxyethylene ether adjuvants such as those described in WO 99/52549AL 

25 Other preferred adjuvants include adjuvant molecules of the general 

formula 

(I): HO(CH2CH20)n-A-R, 
wherein, n is 1-50, A is a bond or -C(0>-, R is Ci-50 alkyl or Phenyl C1.50 alkyl. 

One embodiment of the present invention consists of a vaccine 
30 formulation comprising a polyoxyethylene ether of general formula (I), wherein n is 
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between 1 and 50, preferably 4-24, most preferably 9; the J? component is C^o, 
prefembly C4-C20 alkyl and most preferably Cn alkyl, and ^ is a bond. The' 
concentration of the polyoxyelhylene ethers should be in the range O.I.200/0, preferably 
fiom 0.1-100/0, and most preferably in the range 0.1-1%. Preferred polyoxyeAylene 
5 ethers are selected fiom the foUowing group: polyoxyethylene-9-lauryl ether. 
polyoxyethylene-9-steory] ether, polyoxyethylene-8-steoiyl ether, polyoxyethylene-4- 
laiuyl ether, polyoxyethyIene-35-lamyl ether, and polyoxyethylene-23-lauiyl ether. 
Polyoxyethylene ethers such as polyoxyethylene lamyl ether are described in the Merck 
index (12* edition: ent^r 7717). These adjuvant molecules are described in WO 
10 99/52549. 

The polyoxyethylene ether according to the general fomiula (I) above 
may. if desired, be combined with another adjuvant. For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 
GB 9820956.2. 



15 



According to anotiier embodiment of this invention, an immunogenic 
composition described herein is delivered to a host via antigen presenting cells (APCs). 
such as dendritic cells, macrophages. B cells, monocytes and otiier ceUs tiiat may be' 
engineered to be efficient APCs. Such cells may. but need not, be genetically modified 
to increase the capacity for presenting tiie antigen, to impix,ve activation and/or 
20 maintenance of tire T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with tiie receiver (ie.. matched HLA haplotype). APCs 
may generally be isolated fi-om any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic 
or xenogeneic cells. 

25 Certain preferred embodiments of the present invention use dendritic 

cells or progenitors tiiereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steimnan. Nature iP2:245-251, 1998) and have been shown to 
be effective as a physiological adjuvant for eliciting prophylactic or ti^empeutic 
antitumor immmiity (see Timmerman and Levy, Ann Rev. Med 50:507-529, 1 999). In 

30 general, dendritic cells may be identified based on their typical shape (stellate in situ, 
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with marked cytoplasmic processes (dendrites) visible in vit)'o\ their ability to take up, 
process and present antigens with high efficiency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 
surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
5 vivo, and such modified dendritic cells are contemplated by the present invention. As 
an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine (see Zitvogel et al., Nature Med ^:594-600, 
1998). 

Dendritic cells and progenitors may be obtained fi'om peripheral blood, 

10 bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph 
nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For 
example, dendritic cells may be differentiated ex vivo by adding a combination of 
cytokines such as GM-CSF, IL-4, IL-13 and/or TNFa to cultures of monocytes 
harvested fi-om peripheral blood. Alternatively, CD34 positive cells harvested fi-om 

15 peripheral blood, umbilical cord blood or bone marrow may be differentiated into 
dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFa, 
CD40 ligand, LPS, flt3 ligand and/or otiier compound(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 

20 cells, which allows a simple way to discriminate between two well characterized 
phenotypes. However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature dendritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 
correlates with the high expression of Fey receptor and mannose receptor. The mature 

25 phenotype is typically characterized by a lower expression of these markers, but a high 
expression of cell surface molecules responsible for T cell activation such as class I and 
class II MHC, adhesion molecules {e.g„ CD54 and GDI 1) and costimulatory molecules 
(e.g„ CD40, CD80, CD86 and 4-lBB). 

APCs may generally be transfected with a polynucleotide of the 

30 invention (or portion or other variant thereof such that the encoded polypeptide, or an 
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immunogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a pharmaceutical composition comprising such transfected ceUs 
may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
deliv«y vehicle that targets a dendritic or other antigen presenting cell may be 
5 administered to a patient, resulting in transfection that occurs /« vivo. In vivo and ex 
vivo transfection of dendritic cells, for example, may generally be perfomied using any 
methods knovvn in the art. such as tiiose described in WO 97/24447, or the gene gun 
approach described by Mahvi et al.. Immunology and cell Biology 75:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic ceUs or 
10 progenitor cells Avith the tumor polypeptide, DNA (naked or witiiin a plasmid vector) or 
RNA; or with antigen-expressing recombinant bacterium or viruses (e.g.. vaccinia, 
fowlpox. adenovirus or lentivirus vectors). Prior to loading, the polypq,tide may be 
covalently conjugated to an immunological partner that provides T cell help (e.g., a 
carrier molecule). Alternatively, a dendritic ceU may be pulsed witii a non-conjugaied 
15 immunological partner, separately or in the presence of the polypeptide. 

WMle any suitable carrier known to those of ordinary skill in the art may 
be employed in the pharmaceutical compositions of this invention, the type of carrier 
will typicaUy vary depending on the mode of administration. Compositions of the 
present invention may be formulated for any appropriate mamier of administiBtion, 
20 including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 
intraperitoneal, subcutaneous and intramuscular administration. 

Carriers for use within such pharmaceutical compositions are 
biocompatible, and may also be biodegradable. I„ certain embodiments, the 
formulation preferably provides a relatively constant level of active component release. 
25 In other embodiments, however, a more rapid rate of release immediately upon 
administration may be desired. The formulation of such compositions is weU within the 
level of ordinary skill in the art using known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide-co-glycolide). polyacrylate, latex, 
starch, cellulose, dextran and the like. Other illustrative delayed-release caniers 
30 include supramolecular biovectors. which comprise a non-liquid hydrophilic core ie.g.. 
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a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 
comprising an amphiphilic compound, such as a phospholipid {see e.g., U.S. Patent No. 
5,151^54 and PCX applications WO 94/20078, WO/94/23701 and WO 96/06638). The 
amount of active compound contained vvithin a sustained release formulation depends 
S upon the site of implantation, the rate and expected diiration of release and the nature of 
the condition to be treated or prevented. 

In another illustrative embodiment, biodegradable microspheres {e.g., 
polylactate polyglycolate) are employed as carriers for the compositions of this 
invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 

10 Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
5,814,344, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems, 
such as described in WO/99 40934, and references cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 
comprising particulate-protein complexes, such as those described in U.S. Patent No. 

15 5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 
responses in a host 

The pharmaceutical compositions of the invention will often fijrther 
comprise one or more buffers (e.g., neutral buffered saline or phosphate buffered 
saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 

20 polypeptides or amino acids such as glycine, antioxidants, bacteriostals, chelating 
agents such as EDTA or glutathione, adjuvants (e.g., alxmiinum hydroxide), solutes that 
render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a 
recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

25 The pharmaceutical compositions described herein may be presented in 

xmit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
are typically sealed in such a way to preserve the sterility and stability of the 
formulation until use. la general, formulations may be stored as suspensions, solutions 
or emulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
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may be stored in a fieeze^lried condition requiring only the addition of a sterile liquid 
carrier immediately prior to xise. 

The development of suitable dosing and treatment regimens for usmg the 
particular compositions described herein in a variety of treatment regimens, including 
5 e.g.. oral, parenteral, intravenous, intranasal, and intramuscular administration and 
formulation, is well known in the art, some of which are briefly discussed below for 
gaieral purposes of illustration. 

hi certain applications, the pharmaceutical compositions disclosed herein 
may be delivered via oral administration to an animal. As such, these compositions 
10 may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-sheU gelatin capsule, or they may be compressed into 
tablets, or they may be incorporated directly with the food of the diet. 

The active compounds may even be incorporated with exdpients and 
used in the form of mgestible tablets, buccal tables, troches, capsules, elixirs, 
15 suspensions, syrups, wafers, and the like (see, for example, Mathiowitz et al. Nature 
1997 Mar 27;386(6623):410-J; Hwang et al., Crit Rev Ther Drug Carrier Syst 
1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
5,792,451). Tablets, troches, piUs, capsules and the like may also contam any of a 
variety of additional components, for example, a binder, such as gum tragacanth, acacia, 
20 cornstarch, or gelatin; excipients, such as dicalcium phosphate; a dismtegrating agent, 
such as com starch, potato starch, alginic acid and the like; a lubricant, such as 
magnesium stearate; and a sweetenmg agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 
25 materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit. For instance, 
tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form should be pharmaceutically pure and 
substantially non-toxic in the amounts employed. In addition, the active compounds 
30 may be incorporated mto sustained-release preparation and formulations. 
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Typically, these formulations vnll contain at least about 0.1% of the 
active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveni^tly be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Natxirally, the amount of 
5 active compoxmd(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any given unit dose of the compoimd. 
Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 

10 variety of dosages and treatment regimens may be desirable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients in the form of a mouthwash, 
dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Alternatively, the active ingredient may be incorporated into an oral solution such as 

15 one containing sodiimi borate, glycerin and potassiimi bicarbonate, or dispersed in a 
dentifrice, or added in a therapeutically-effective amoimt to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 
Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 

20 In certain circumstances it will be desirable to deliver the phannaceutical 

compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
intraperitoneally. Such approaches are well known to the skilled artisan, some of which 
are further described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 
and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 

25 as free base or pharmacologically acceptable salts may be prepared in water suitably 
mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 
Under ordinary conditions of storage and use, these preparations generally will contain a 
preservative to prevent the growth of microorganisms. 



94 



wo 02/00174 



PCT/DSOl/21065 



Illustrative pharmaceutical forms suitable for injectable use include 
sterile aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 
5,466,468). In all cases the form must be sterile and must be fluid to the extent that 
5 easy syringabHity exists. It must be stable under the conditions of manufecture and 
storage and must be preserved against the contaminating action of microorganisms, 
such as bacteria and fungi. ITie carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol (e.g.. glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 

10 oils. Proper fluidity may be maintained, for example, by the use of a coating, such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and/or 
by the use of surfactants. TTie prevention of the action of microorganisms can be 
facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 

15 preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum monostearate 
and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, 
20 the solution should be suitably buffered if necessary and the Hquid diluent first rendered 
isotonic with sufficient saline or glucose. These particular aqueous solutions are 
especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, a sterile aqueous medium that can be employed will 
be knovm to those of skUI in the art in light of the present disclosure. For example, one 
25 dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human administration, preparations wUl of 



95 



wo 02/00174 



PCT/USOl/21065 

I 



course preferably meet sterility, pyrogenicity, and the general safety aiid purity 
standards as required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed . 
herein may be formulated in a neutral or salt form. Illustrative 
S pharmaceutically-acceptable salts include the acid addition salts (formed v^th the free 
amino groups of the protein) and which are formed with inorganic adds such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived fiom inorganic bases such as, for example, sodium, potassium, ammonium, 

10 calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
administered in a manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion media, 

15 vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, buffers, carrier solutions, susprasions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 
ingredient, its use in the therapeutic compositions is contemplated. Supplementary 

20 active ingredients can also be incorporated into the compositions. The phrase 
"pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar xmtoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be 
delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. 

25 Methods for delivering genes, nucleic acids, and peptide compositions directly to the 
lungs via nasal aerosol sprays has been described, e.g,, in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal microparticle 
resins (Takenaga et al, J Controlled Release 1998 Mar 2;52(l-2):81-7) and 
lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871) are also well-known m 
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thephaimaceutical arts. Likewise, illustrative transmucosa] drag delivery in the fonn of 
a polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

Ill certain embodiments, liposomes, nanocapsules, microparticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
5 present invention into suitable hosi cells/organisms. In particular, the compositions of 
the present invention may be formulated for delivery either encapsulated in a lipid 
particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, 
compositions of the present invention can be bound, eitiier covalently or non-covalently, 
to the surface of such carrier vehicles. 

, '^^ formation and use of liposome and liposome-like preparations as 

potential drug carriers is generally known to tiiose of skill in the art (see for example, 
•Lasic, Trends Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 
Mar;56(3):691-5; Chandran et al, Indian J Exp Biol. 1997 Aug;35(8):801-9; MaigaHt, 
Crit Rev Ther Drug Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 
15 Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 
5,795,587, each specifically incorporated herein by reference in its entirety). 

Uposomes have been used successfully witii a number of cell types that 
are normally difficult to transfect by other procedures, including T ceU suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et al. J Biol Chem. 1990 Sep' 

20 25;265(27):16337^2; MuUer et al, DNA Cell Biol. 1990 Apr;9(3):221-9). In addition, 
liposomes are free of tiie DNA length constraints that are typical of viral-based deliver^ 
systems. Liposomes have been used effectively to introduce genes, various drugs, 
radiolherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
flie like, into a variety of cultured cell lines and animals. Furthermore, he use of 

25 liposomes does not appear to be associated witii autoimmune responses or unacceptable 
toxicity after systemic delivery. ' 

In certain embodiments, liposomes are formed fiom phosphoKpids tiiat 
are dispersed in an aqueous medium and spontaneously fonn multilamellar concentric 
bilayer vesicles (also termed multilamellar vesicles (MLVs). 
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Alternatively, in other embodiments, the invention provides for 
pharmaceutically-acceptable nanocapsule formulations of the compositions of the 
present invention. Nanocapsules can generally entrap compounds in a stable and 
reproducible way (see, for example, Quintanar-Guerrero et aL^ Drug Dev Ind Pharm. 
5 199& Dec;24(l 2): 1113-28). To avoid side effects due to intracellular polymeric 
overloading, such ultrafine particles (sized around 0.1 (xm) may be designed using 
polymers able to be degraded in vivo. Such particles can be made as described, for 
example, by Couvreur etaL, Crit Rev Ther Drug Cairier Syst. 1988;5(l):l-20; zur 
Muhlen et al, Eur J Pharm Biophann. 1998 Mar;45(2): 149-55; Zambaux et al J 
10 Controlled Release. 1998 Jan 2;50(l-3):3 1-40; and U. S. Patent 5,145,684. 

Cancer Therapeutic Methods 

Immunologic approaches to cancer therapy are based on the recognition 
that cancer cells can often evade the body's defenses against aberrant or foreign cells 
and molecules, and that these defenses might be therapeutically stimulated to regain the 
15 lost ground, e.g. pgs. 623-648 in Klein, Immunology (Wiley-Interscience, New York, 
1982). Numerous recent observations that various immune effectors can directly or 
indirectly inhibit growth of tumors has led to renewed interest in this approach to cancer 
therapy, e.g. Jager, et al.. Oncology 2001;60(l):l-7; Renner, et al., Ann Hematol 2000 
Dec;79(12):651-9. 

20 Four-basic cell types whose function has been associated with antitumor 

cell immimity and the elimination of tumor cells from the body are: i) B-lymphocytes 
which secrete immunoglobulins into the blood plasma for identifying and labeling the 
nonself invader cells; ii) monocytes which secrete the complement proteins that are 
responsible for lysing and processing the immunoglobulin-coated target invader cells; 

25 iii) natural killer lymphocytes having two mechanisms for the destruction of tumor 
cells, antibody-dependent cellular cytotoxicity and natural killing; and iv) T- 
lymphocytes possessing antigen-specific receptors and having the capacity to recognize 
a tumor cell carrying complementary marker molecules (Schreiber, H,, 1989, in 
Fundamental Immunology (ed). W. E. Paul, pp. 923-955). 
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Cancer immunotherapy generally focuses on inducing humoral immune 
responses, cellular immune responses, or bolh. Moreover, it is well established that 
induction of CD4* T helper cells is necessary in order to secondarily induce either 
antibodies or cytotoxic CD8* T ceUs. Polypeptide antigens that are selective or ideaUy 
5 specific for cancer cells, particularly lung cancer cells, offer a powerfid approach for 
inducing immune responses against lung cancer, and are an important aspect of tiie 
present invoition. 

Therefore, in further aspects of the present invention, the pharmaceutical 
compositions described herein may be used for the treatment of cancer, particularly for 
10 tiie immunotherapy of lung cancer. Within such metiiods, the phannaceutical 
compositions described herein are administered to a patient, typically a warm-blooded 
ammal, preferably a human. A patient may or may not be afflicted witii cancer. 
Accordingly, the above pharmaceutical compositions may be used to prevent the 
development of a cancer or to treat a patient afflicted with a cancer. Pharmaceutical 
15 compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treabnent such as administration of radiotherapy or 
conventional chemotherapeutic drugs. As discussed above, administration of the 
phannaceutical compositions may be by any suitable method, including administration 
by intravenous, intiraperitoneal, intiamuscular, subcutaneous, intranasal, intradermal, 
20 anal, vaginal, topical and oral routes. 

Within certain embodiments, immunotherapy may be active 
immunother^y. in which treatinent relies on the in vivo stimulation of the endogenous 
host immune system to react against tumors with the administration of immune 
response-moditying agents (such as polypeptides and polynucleotides as provided 
25 herein). 

Witiiin other embodiments, immunotherapy may be passive 
immunotherapy, in which ti-eatment involves the delivery of agents with established 
tumor-immune reactivity (such as effector cells or antibodies) tiiat can directiy or 
indirectiy mediate antitumor effects and does not necessarily depend on an intact host 
30 immune system. Examples of effector cells include T cells as discussed above, T 
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lymphocytes (such as CD8^ cytotoxic T lymphocytes and CD4^ T-helper tumor- 
infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 
activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and 
macrophages) expressmg a polypeptide provided herein. T cell receptors and antibody 
5 receptors specific for the polypeptides recited herein may be cloned, expressed and 
transferred into other vectors or effector cells for adoptive immunother^y. The 
polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 
immunotherapy. 

10 Monoclonal antibodies may be labeled with any of a variety of labels for 

desired selective usages in detection, diagnostic assays or therapeutic applications (as 
described in U.S. Patent Nos. 6,090,365; 6,015,542; 5,843,398; 5,595,721; and 
4,708,930, hereby incorporated by reference in their entirety as if each was incorporated 
individually). In each case, the binding of the labelled monoclonal antibody to the 

15 determinant site of the antigen will signal detection or delivery of a particular 
therq)eutic agent to the antigenic determinant on the non-normal cell. A further object 
of this invention is to provide the specific monoclonal antibody suitably labelled for 
achieving such desired selective usages thereof. 

Effector cells may generally be obtained in sufficient quantities for 

20 adoptive immunotherapy by growth in vitro^ as described herein. Culture conditions for 
expanding single antigen-specific effector cells to several billion in number with 
retention of antigen recognition in vivo are well known in the art. Such in vitro culture 
conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as IL-2) and non-dividing feeder cells. As noted above, 

25 immunoreactive polypeptides as provided herein may be used to rapidly expand 
antigen-specific T cell cultures in order to generate a sufficient number of cells for 
immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 

30 in the art. For example, antigen-presenting cells can be transfected with a 



100 



wo 02/00174 



PCT/USOl/21065 



polynucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long tenn in vivo. Studies 
have shoxvn that cultured effector cells can be induced to grow in vivo and to survive 
5 long term in substantial numbers by repeated stimulation with antigen supplemented 
with IL.2 (see, for example. Cheever et al.. Immunological Reviews 157:111, \ 997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting ceUs taken from a patient and clonally pix,pagated ex 
vivo for transplant back into the same patient. Transfected cells may be reintroduced 
10 mto the patient using any means known in tiie art, preferably in sterile fonn by 
intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes and fequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary from individual to individual, and may be 
readUy established using standard techniques. In general, the pharmaceutical 
15 compositions and vaccines may be administered by injection (e.g., intmcutaneous. 
intramuscular, inti-avenous or subcutaneous), intranasally ie.g., by aspiration) or oiBlly 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 
20 individual patients. A suitable dose is an amomit of a compound that, when 
administered as described above, is capable of promoting an anti-tumor immune 
response, and is at least 10-50% above the basal (/.e.. untreated) level. Such response 
can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine- 
dependent generation of cytolytic effector cells capable of killing die patient's tumor 
25 ceIls«vi>o. SuchVaccinesshouldalsobecapableofcausinganimmuneresponsethat 
leads to an improved clinical outcome {e.g., more frequent remissions, complete or 
partial or longer disease-free survival) in vaccinated patients as compared to non- 
vaccinated patients. In general, for pharmaceutical compositions and vaccines 
comprising one or more polypeptides, tiie amount of each polypeptide present in a dose 
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ranges from about 25 to 5 mg per kg of host. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0.1 mL to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the 
active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic 
5 benefit. Such a response can be monitored by establishing an improved clinical 
outcome (e.g., more frequent remissions, complete or partial, or longer disease-free 
survival) in treated patients as compared to non-treated patients. Increases in 
preexisting immune responses to a tumor protein generally correlate with an improved 
clinical outcome. Such immune responses may generally be evaluated using standard 
10 proliferation, cytotoxicity or cytokine assays, which may be performed using samples 
obtained from a patient before and after treatment. 

Cancer Detection and Diagnostic Compositions, Methods and Kits 

In general, a cancer may be detected in a patient based on the presence of 
one or more lung tumor proteins and/or polynucleotides encoding such proteins in a 

15 biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) 
obtained from the patient. In other words, such proteins may be used as markers to 
indicate the presence or absence of a cancer such as lung cancer. In addition, such 
proteins may be useful for the detection of other cancers. The binding agents provided 
herem generally permit detection of the level of antigen that binds to the agent in the 

20 biological sample. 

Polynucleotide primers and probes may be used to detect the level of 
mKNA encoding a tumor protein, which is also indicative of the presence or absence of 
a cancer. In general, a tumor sequence should be present at a level that is at least two- 
fold, preferably three-fold, and more preferably five-fold or higher in tumor tissue than 

25 in normal tissue of the same type from which the tumor arose. Expression levels of a 
particular timior sequence in tissue types different fix)m that in which the tumor arose 
are irrelevant in certain diagnostic embodiments since the presence of tumor cells can 
be confirmed by observation of predetermiaed differential expression levels, e.g., 2- 
fold, 5-fold, etc, in tumor tissue to expression levels in normal tissue of the same type. 
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Other differential expression patterns can be utilized advantageously for 
diagnostic purposes. For example, in one aspect of the invention, ovei^xpression of a 
tumor sequence in tumor tissue and nonnal tissue of the same type, but not in other 
normal tissue types. e.g. PBMCs. can be exploited diagnosticaUy. In this case, the 
5 presence of metastatic tumor cells, for example in a sample taken from the circulation 
or some other tissue site different fiom that in vWch the tumor arose, can be identified 
and/or confirmed by detecting expression of the tumor sequence in the sample, for 
example using RT-PCR analysis. In many instances, it .vill be desired to enrich for 
tumor cells in the sample of interest, e.g., PBMCs, using cell capture or other like 
10 techniques. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory,' 
1 988. In general, the presence or absence of a cancer in a patient may be determined by 
15 (a) contacting a biological sample obtained from a patient with a binding agent; (b) 
detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 
comparing the level of polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding agent 
immobilized on a solid support to bind to and remove the polypeptide from the 
20 remainder of the sample. Hie bound polypeptide may then be detected using a detection 
reagent that contains a rqxjrter group and specifically binds to the binding 
agent/polypq,tide complex. Such detection reagents may comprise, for example a 
binding agent that specificaUy binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G 
25 protein A or a lectia Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 
binding agent after incubation of the binding agent with the sample. extent to 
^•ch components of the sample inhrbit the binding of the labeled polypeptide to the 
binding agent is indicative of the reactivity of the sample witii the immobilized binding 
30 agent. Suitable polypeptides for use within such assays include fWl length lung tumor 
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proteins and polypeptide portions thereof to which the binding agent binds, as described 
above. 

The solid support may be any material known to those of ordinary skill 
in the art to which the tumor protem may be attached. For example, the solid support 
5 may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 

10 using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature, hi the context of the present invention, 
the term "immobilization" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 

15 Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, in 
a suitable buffer, with the solid support for a suitable amount of tune. The contact time 
varies with temperature, but is typically between about 1 hour and about 1 day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 

20 polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 
10 Jig, and preferably about 100 ng to about 1 jig, is sufficient to immobilize an 
adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifimctional reagent that will react with 

25 both the support and a functional group, such as a hydroxyl or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner {see, Pierce Immunotechnology Catalog and Handbook, 1991, at 

30 A12-A13). 
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In certain embodiments, the assay is a two-antibody sandwich assay 
This assay may be perfomed by first contacting an antibody that has been immobihzed 
on a solid support, commonly the well of a microliter plate, with the sample, such that 
polypeptides within the sample are aUowed to bind to the immobiKzed antibody 
5 Unbound sample is then n»noved from the immobilized poljpeptide-anfbody 
complexes and adetection reagent (preferably a second antibody capable of binding to a 
different site on the polypeptide) containing a reporter group is added. Tl,e amount of 
detection reagent that remains bound to the solid ^pport is then determined using a 
method appropriate for the specific reporter group. 
10 More specifically, once the antibody is immobilized on the support as 

described above, the remaimng protein binding sites on the support ar^ typically 
blocked. Any suitable blocking agent known to those of ordinaiyskiU in theart such as 
bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis. MO) The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
15 bind to the antibody. Tl,e s^ple may be dihited with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
tm.e (,.e, incubation time) is a period of time that is sufiBcient to detect the presence of 
polypeptide within a sample obtained from an individual with lung cancer. Preferably 
the contact time is sufBcient to achieve a level of binding that is at least about 95% of 
20 that achieved at equilibrium between bound and unbound polypeptide. ITiose of 
ordinary skill in the art wiU r«:ognize that the time necessary to achieve equilibrium 
may be readily determined by assaying the level of binding that occurs over a period of 
tmie. At room tempemture, an incubation time of about 30 minutes is generally 
sufiBcient. 

25 Unbound sample may then be removed by washing the solid support 

vnih an appropriate buffer, such as PBS containing O.P/o Tween 20™. n,e second 
antibody, which contains a reporter group, may then be added to the solid support 
Preferred reporter groups include those groups recited above. 

The detection reagent is then incubated with the immobilized antibody- 

30 polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 
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An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound detection reagent is then removed 
and bound detection reagent is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
5 radioactive groups, scintillation counting or autc. biographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin, coupled to a diflFerent 
reporter group (commonly a radioactive or fluorescent group or an en2yme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
10 specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

To determine the presence or absence of a cancer, such as lung cancer, 
the signal detected from the reporter group that remains bound to the solid support is 
generally compared to a signal that corresponds to a predetermined cut-off value. In 

15 one preferred embodiment, the cut-off value for the detection of a cancer is the average 
mean signal obtained when the immobilized antibody is incubated with samples from 
patients without the cancer. In general, a sample generating a signal that is three 
standard deviations above the predetermined cut-off value is considered positive for the 
cancer. In an alternate preferred embodiment, the cut-oflf value is determined using a 

20 Receiver Operator Curve, according to the method of Sackett et al.. Clinical 
Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, 
p. 106-7. Briefly, in this embodiment, the cut-off value may be determined firom a plot 
of pairs of true positive rates {i,e,, sensitivity) and false positive rates (100%-specificity) 
that correspond to each possible cut-off value for the diagnostic test result. The cut-off 

25 value on the plot that is the closest to the upper left-hand comer (i.e., the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
signal that is higher than the cut-off value determined by this method may be considered 
positive. Alternatively, the cut-off value may be shifted to the left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative rate. In 
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general, a sample generating a signal that is higher than the cut-offvalue detennined by 
this mefliod is considered positive for a cancer. 

In a related embodiment, the assay is performed in a flow-tbiough or 
strip test foxmat. wherein the binding agent is immobilized on a membmne, such as 
5 nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second 
labeled binding agent then binds to the binding agent-polypeptide complex as a solution' 
containing the second binding agent flows through the membrane. The detection of 
bound second binding agent may then be performed as described above. In the strip test 
10 format, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 
region contaimng second binding agent and to the area of immobilized bmding agent. 
Concentration of second binding agent at the area of hmnobilized antibody indicates the 
presence of a cancer. Typically, the concentration of second binding agent at that site 
15 generates a pattern, such as a Ime, that can be read visually. The absence of such a 
pattern indicates a negative result. In general, the amount of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattern when the 
biological sample contams a level of polypeptide that would be sufficient to generate a 
positive signal in the two-antibody sandwich assay, in the format discussed above 
20 Preferred bmding agents for use in such assays are antibodies and antigen-binding 
fragments thereof. Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about Ijig, and more preferably from about 50 ng to about 
500 ng. Such tests can typically be performed with a very small amount of biological 
sample. 

Of course, numerous other assay protocols exist that are suitable for use 
vdth the tumor protems or binding agents of the present invention. The above 
descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may be readily modified to use 
tumor polypeptides to detect antibodies that bind to such polypeptides in a biological 
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sample. The detection of such tumor protein specific antibodies may correlate with the 
presence of a cancer. 

A cancer may also, or alternatively, be detected based on the presence of 
T cells tliat specifically react with a tumor protein in a biological sample. Within 
5 certain methods, a biological sample comprising CD4^ and/or CD8* T cells isolated 
fix>m a patient is incubated with a tumor polypeptide, a polynucleotide «icoding such a 
polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 
Suitable biological samples include, but are not limited to, isolated T cells. For 

10 example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T 
cells may be incubated in vitro for 2-9 days (typically 4 days) at 37^C with polypeptide 
(eg., 5-25 jig/ufil). It may be desirable to incubate another aliquot of a T cell sample in 
the absence of tumor polypeptide to serve as a control. For CD4'*^ T cells, activation is 

15 preferably detected by evaluating proliferation of the T cells. For CDS"*^ T cells, 
activation is preferably detected by evaluating cytolytic activity. A level of proliferation 
that is at least two fold greater and/or a level of cytolytic activity that is at least 20% 
greater than in disease-free patients indicates the presence of a cancer in the patient. 

As noted above, a cancer may also, or alternatively, be detected based on 

20 the level of mRNA encoding a tumor protein in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction 
(PGR) based assay to ampliiy a portion of a tumor cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for (/.e., 
hybridizes to) a polynucleotide encoding the tumor protein. The amplified cDNA is 

25 then separated and detected using techniques well known in tlie art, such as gel 
electrophoresis. 

Similarly, oligonucleotide probes that specifically hybridize to a 
polynucleotide encoding a tumor protein may be used in a hybridization assay to detect 
the presence of polynucleotide encoding the tumor protein in a biological sample. 
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To permit hybridization under assay conditions, oUgonucleotide primers 
and probes should comprise an oUgonucleotide sequence that has at least about 60% 
preferably at least about 75% and more preferably at least about 90-/o, identity to a 
portion of a polynucleotide encoding a tumor protein of the invention that is at least 10 
5 nucleotides, and preferably at least 20 nucleotides, in length. Preferably 
oligonucleotide primers and/or pr.,bes hybridize to a polynucleotide encoding a 
polypeptide described herein under moderately stringent conditions, as defined above. 
Oligonucleotide primers and/or probes which may be usefully employed in the 
diagnostic methods described herein preferably are at least 10-40 nucleotides in length 

10 In a ptefen-ed embodiment, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 
having a sequence as disclosed herein. Techniques for both PGR based assays and 
hybridization assays are well known in the art {see, for example, Mullis et al.. Cold 
Spring Harbor Symp. Quant Biol, 5J:263, 1987; Erlich ed., PCR Technology, St^ton 

15 Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCai is applied in 
conjunction with reverse transcription. Typically, RNA is retracted from a biological 
sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PGR amplification using at least one specific primer generates a cDNA molecule, which 

20 may be separated and visualized usmg, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken from a test patient and fiom an 
individual who is not afflicted with a cancer. The amplification reaction may be 
performed on several dilutions of cDNA spamiing two orders of magnitude. A two-fold 
or greater increase in expiession in several dilutions of the test patient sample as 

25 compared to the same dilutions of the non-cancerous sample is typically considered 
positive. 

In another aspect of the present invention, cell capture technologies may 
be used in conjunction, with, for example, real-time PGR to provide a more sensitive 
tool for detection of metastatic cells expressing lung tumor antigens. Detection of lung 
30 cancer cells in biological samples, eg., bone marrow samples, peripheral blood, and 
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small needle aspiration samples is desirable for diagnosis and prognosis in lung cancer 
patients. 

Immunomagnetic beads coated with specific monoclonal antibodies to 
surface cell markers, or tetrameric antibody complexes, may be used to first enrich or 
5 positively select cancer cells in a sample. Various commercially available kits may be 
used, including Dynabeads® Epithelial Enrich (Dynal Biotech, Oslo, Norway), 
StemSep™ (StemCell Technologies, Inc., Vancouver, BC), and RosetteSep (StemCell 
Technologies). A skilled artisan will recognize that other methodologies and kits may 
also be used to enrich or positively select desired cell populations. Dynabeads® 

10 Epithelial Enrich contains magnetic beads coated with mAbs specific for two 
glycoprotein membrane antigens expressed on normal and neoplastic epithelial tissues. 
The coated beads may be added to a sample and the sample then applied to a magnet, 
thereby capturing the cells bound to the beads. The imwanted cells are washed away 
and the magnetically isolated cells eluted from the beads and used in further analyses. 

15 RosetteSep can be used to enrich cells directly fi-om a blood sample and 

consists of a cocktail of tetrameric antibodies that targets a variety of unwanted cells 
and crosslinks them to glycophorin A on red blood cells (RBC) present in the sample, 
forming rosettes. When centrifiiged over Ficoll, targeted cells pellet along with the fi-ee 
RBC. The combination of antibodies in the depletion cocktail determines which cells 

20 will be removed and consequently which cells will be recovered. Antibodies that are 
available include, but are not limited to: CD2, CD3, CD4, CDS, CDS, CDIO, CDllb, 
CD14, CD15, CD16, CD19, CD20, CD24, CD25, CD29, CD33, CD34, CD36, CD38, 
CD41, CD45, CD45RA, CD45RO, CD56, CD66B, CD66e, HLA-DR, IgE, and TCRap. 

Additionally, it is contemplated in the present invention that mAbs 

25 specific for Ixmg tumor antigens can be generated and used in a similar manner. For 
example, mAbs that bind to tumor-specific cell surface antigens may be conjugated to 
magnetic beads, or formulated in a tetrameric antibody complex, and used to enrich or 
positively select metastatic lung tumor cells fi-om a sample. Once a sample is enriched 
or positively selected, cells may be lysed and RNA isolated. RNA may then be 

30 subjected to RT-PCR analysis using lung tumor-specific primers in a real-time PGR 
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assay as described herein. Qae skilled in the art recognize that enriched or selected 
populations of ceUs may be analyzed by other methods (eg. /„ situ hybridization or 
flow cytometry). 

In another embodiment, the compositions described herein may be used 
asmarkersfortheprogt^ionofcancer. In this embodiment, assays as described above 
for the diagnosis of a cancer may be perfonned over time, and the change in the level of 
^cfve poIypeptide(s) or poly„ucleotide(s) evaluated. For example, the assays may be 
perfonned every 24-72 hours for a period of 6 months to I year, and thereafter 
perfonned as needed. ^ general, a cancer is progressing in those patients in whom the 
10 level of polypeptide or polynucleotide detected increases over time. In contrast the 
c^cer as not progressing when the level of reactive polj^ptide or polynucleotide either 
remains constant or decreases with time. 

Certain in vivo diagnostic assays may be perfonned directly on a tumor 
One such assay involves contacting tumor cells with a binding agent. Tk. bound 

15 agent may then be detected directly or indirectly via a reporter group. Such 

binding agents may also be used in histological appUcations. Alternatively 
polynucleotide probes may be used witiim such q)plications. 

As noted above, to unprove sensitivity, multiple tiamor protein markers 
-aybeassayedwitinnagivensample. It ^viU be apparent that binding agents specific 
20 for differ^t proteins provided herein may be combined witirin a single assay. Further 
multiple primers or probes may be used concunentiy. The selection of tumor protein 
marker, may be based on routine experiments to detennine combinations that results in 
optimal sensitivity, m addition, or altenaatively, assays for tumor proteins provided 
herem may be combined with assays for other known tumor antigens. 
25 The present invention further provides kits for use within any of tiie 

above diagnostic methods. Such kits tj^ically comprise two or more components 
necessary for perfonning a diagnostic assay. Components may be compounds, reagents 
contamers and/or equipment. For example, one container v^dtiun a kit may contain J 
monoclonal antibody or fbgment tiiereof ti.at specifically binds to a tiunor protein 
30 Such antibodies or fragments may be provided attached to a support material as 
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described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or alternatively, contain 
a detection reagent as described above that contains a reporter group suitable for direct 
or indirect detection of antibody binding. 
5 Alternatively, a kit may be designed to detect the level of mKNA 

encoding a tumor protein in a biological sample. Such kits generally comprise at least 
one oligonucleotide probe or primer, as described above, that hybridizes to a 
polynucleotide encoding a tumor protein. Such an oligonucleotide may be used, for 
example, within a PCR or hybridization assay. Additional components that may be 
10 present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a timior protein. 

The following Examples are offered by way of illustration and not by 
way of limitation. 
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EXAMPLRS 



10 



EXAMPLE 1 

ISOLATION AND 0HAFArTPFT7^Tf 0N0FcDNA SF.OTTPN CES KNmnTMO 
LUNG TIJMO R POr.YPBTmnt?^ 
TWs example illustrates the isolation of cDNA molecules encoding lung 
tumor-specific polypq)tides from lung tumor cDNA libraries. 

A. ISOLATION OF cDNA SEQUENCES FROM A LUNG SQUAMOUS CELL 
CARCINOMA LIBRARY 

A human lung squamous cell carcinoma cDNA expression library was 
constructed from poly A^ RNA from a pool of two patient tissues using a Superscript 
Plasmid System for cDNA Synthesis and Plasmid Cloning kit (BRL Life Technologies 
Gaithersburg, MD) following the manufactu,.rt protocol Specifically. lung carcinoma 
tissues were homogenized with polytron (Kinematica, SAvitzerland) and total RNA was 
15 extracted using Trizol reagent (BRL Ufe Technologies) as directed by the manufacturer 
The poly A^ RNA was then purified using an oligo dT cellulose column as described in 
Sambrook etal.. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratories. Cold Spring Harbor, NY. 1989. First-strand cDNA was synthesized using 
the NotI/01igo-dT18 primer. Double-shaded cDNA was synthesized, ligated with 
20 BstXI^Rl adaptor. Onvitrogen. San Diego, CA) and digested with Notl. Following 
s,ze fractionation with cDNA size fractionation columns (BRL Life Technologies) the 
CDNA was ligated into the BstXI/NotI site of pcDNA3.1 (tovitrogen) and transfonned 
mto ElectroMax E. coli DHIOB cells (BRL Life Technologies) by electropor^tion. 

Using the same procedure, a nonnal human lung cDNA expression 
25 library was prepared from a pool of four tissue specimens. ITie cDNA libraries were 
characterized by determining the number of independent colonies, the percentage of 
clones that carried insert, the average insert size and by sequence analysis. n,e lung 
squamous cell carcinoma library contained 2.7 x 10« independent colonies, with 100% 
of clones having an insert and the average insert size being 2100 base pairs. TT,e normal 
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lung cDNA library contained 1.4 x 10 independent colonies, with 90% of clones 
having inserts and the average insert size being 1800 base pairs. For both libraries, 
sequence analysis showed that the majority of clones had a full length cDNA sequence 
and were synthesized from mRNA 
5 cDNA library subtraction was performed using the above lung squamous 

cell carcinoma and normal lung cDNA libraries, as described by Hara et al {Bloody 
5^:189-199, 1994) with some modifications. Specifically, a lung squamous cell 
carcinoma-specific subtracted cDNA library was generated as follows. Normal tissue 
cDNA library (80 p.g) was digested with BamHI and Xhol, followed by a filling-m 

10 reaction with DNA polymerase Klenow fragment. After phenol-chloroform extraction 
and ethanol precipitation, the DNA was dissolved in 133 jil of H2O, heat-denatured and 
mixed with 133 pJ (133 pg) of Photoprobe biotin (Vector Laboratories, Burlingame, 
C A). As recommended by the manufacturer, the resulting mixture was irradiated with a 
270 W sunlamp on ice for 20 minutes. Additional Photoprobe biotin (67 |il) was added 

15 and the biotinylation reaction was repeated. After extraction with butanol five times, 
the DNA was efhanol-precipitated and dissolved in 23 pj H2O to form the driver DNA. 

To form the tracer DNA, 10 jig lung squamous cell carcinoma cDNA 
library was digested with Not! and Spel, phenol chloroform extracted and passed 
through Chroma spin-400 colunms (Clontech, Palo Alto, CA). typically, 5 pg of 

20 cDNA was recovered after the sizing column. Following ethanol precipitation, the 
tracer DNA was dissolved in 5 jil H2O. Tracer DNA was mixed with 15 pi driver DNA 
and 20 jil of 2 X hybridization buffer (1.5 M NaCl/10 mM EDTA/50 mM HEPES pH 
7.5/0.2% sodium dodecyl sulfate), overlaid with mineral oil, and heat-denatured 
completely. The sample was immediately transferred into a 68 water bath and 

25 incubated for 20 hours (long hybridization [LH]). The reaction mixture was then 
subjected to a streptavidin treatment followed by phenol/chloroform extraction. This 
process was repeated three more times. Subtracted DNA was precipitated, dissolved in 
12 nl H2O, mixed with 8 \j1 driver DNA and 20 jil of 2 x hybridization buffer, and 
subjected to a hybridization at 68 for 2 hours (short hybridization [SH]). After 

30 removal of biotinylated double-stranded DNA, subtracted cDNA was ligated into 
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NoaSpel site of .hlomnph.„ie„l pBCST (Strabgene, U Jdh, CA) and 

Wormed i«o ElWroMax E. DHIOB cells by elcoto,p«,aon to geo««e a l™^ 
squamous oeU carcinoma specife sutlncted cDNA Ubn., O^h ate to as 

' lung subtraction I"). 

5 ^ ^l"""" «11 "dnoma specific subuaced cDNA 

btao. (re£e„«i to as "lu^ jr) ™s g^aerated iu a sMa, to lung 

subt«.„„ Ub,^ except U« eigh, lta,ue«ly «»ve,ed genes fton, lung sub».«io„ 
I »e« .nchded in the driver DNA, and 24,000 independent clones were re«,v«ed. 

■^°°™''™*'«"*^W«I>NA libraries, plasmid DNA wasimpared 
flom 320 nKlepe^ien, clones, randomly picked Son, the subbacted lung sqnantou, cell 

can^maspecifcbbraries. R^Presentadve cDNA clones were itete ch^^teri^ by 
DNA sconcing ™,b a Peridn El„./App,i.d Biosystems Divisic„ Automated 
Sequencer Mc«.el 373A and,or Model 377 (Foster City, CA). lie cDNA se,«„c.s for 
sixty .solated clones are provided in SEQ ID NO: Tlese sequences ™,e 

compared to known sequences in the gene bank using fl. EMBL .ad Oenftn* 
datatases (release 96). No si^oant hontologies were fotarf to 'the sequels 
provd.dinSEQlDNO:2.3.19.38and46. The sequences of SEQ ID NO- 1 M 10 
.3. .5, 17. ,8, 2(.27. 29. 30. 32. 34-37. 39-45. 47-49. 51. 5^ M. 55 and 57*59 
found to show some homology to previously identilied expressed sequence tags (ESTs) 
20 Tbe sequences of SEQ m NO: 9. 28. 31 a.^ 33 were found to show sc«te homology ,o 
P«-iously identified «,„-hunan gene sequences and the sequences of SEQ ID NO- 4 
5. 14. 50. 53. 56 and 60 were found to show some homology to gene sequences" 
previously identified in humans. 

'*'='*^™P'°<*^'«'te»»iW above was repeated using the above 
25 lung »^ous cell carcmoma cDNA lihary as the tracer DNA. and the above ncamal 
lung W CDNA lib^ry ^ a cDNA library fiom normal Uver and hear, (constru«ed 
ton apco, of one sample of each tissue as described above), plus twenty other cDNA 
clones that were frequently tecovacd in lung subuactions I ami fl. as the driver DNA 
0™.g „,). The nomial liver and heari cDNA libraty contained 1.76 x I0« 

mdependem colonies, with 100% of clones havrng insem and the average insert size 
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being 1600 base pairs. Ten additional clones were isolated (SEQ ID NO: 61-70). 
Comparison of these cDNA sequences with those in the gene bank as described above, 
revealed no significant homologies to the sequences provided in SEQ ID NO: 62 and 
67. The sequences of SEQ ID NO: 61, 63-66, 68 and 69 were found to show some 
5 homology to previously isolated ESTs and the sequence provided in SEQ ID NO: 70 
was found to show some homology to a previously identified rat gene. 

In further studies, the subtraction procedure described above was 
repeated using the above lung squamous cell carcinoma cDNA library as the tracer 
DNA, and a cDNA library from a pool of normal lung, kidney, colon, pancreas, brain, 

10 resting PBMC, heart, skin and esophagus as the driver DNA, with esophagus cDNAs 
making up one third of the driver material. Since esophagus is enriched in normal 
epithelial cells, including differentiated squamous cells, this procedure is likely to 
enrich genes that are tumor specific rather than tissues specific. The cDNA sequences 
of 48 clones determined m this subtraction are provided in SEQ ID NO: 177-224, The 

15 sequences of SEQ ID NO: 177, 178, 180, 181, 183, 187, 192, 195-197, 208, 211, 212, 
215, 216, 218 and 219 showed some homology to previously identified genes. The 
sequences of SEQ ID NO: 179, 182, 184-186, 188-191, 193, 194, 198-207, 209 210, 
213, 214, 217, 220 and 224 showed some homology to previously determined ESTs. 
The sequence of SEQ ID NO: 221-223 showed no homology to airy previously 

20 determined sequence. 

B. ISOLATION OF cDNA SEQUENCES FROM A LUNG 
ADENOCARCINOMA LIBRARY 

A himian limg adenocarcinoma cDNA expression library was 
constructed as described above. The library contained 3.2 x 10^ independent colonies, 
25 with 100% of clones having an insert and the average insert size being 1500 base pairs. 
Library subtraction was performed as described above usmg the normal lung and 
normal liver and heart cDNA expression libraries described above as the driver DNA. 
Twenty-six hundred independent clones were recovered. 
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Initial cDNA sequence analysis from 100 independent clones revealed 
many ribosomal protein genes. The cDNA sequences for fifteen clones isolated in this 
subtraction are provided in SEQ ID NO: 71-86. Comparison of these sequences with 
those m the gene bank as described above revealed no significant homologies to the 
5 sequence provided in SEQ ID NO: 84. The sequences of SEQ ID NO: 71, 73, 74, 77 
78 and 80-82 were found to show some homology to previously isolated ESTs,'and the 
sequences of SEQ ID NO: 72, 75, 76, 79. 83 and 85 we found to show some 
homology to previously identified human genes. 

to further studies, a cDNA library (referred to as mets3616A) was 
10 constructed from a metastatic lung adenocarcinoma. The detennined cDNA sequences 
of 25 clones sequenced at random from this libraiy are provided in SEQ ID NO- 255- 
279. The mets3616A cDNA library was subtracted against a cDNA library pi^ared 
from a pool of normal lung, liver, pancreas, skin, kidney, bram and resting PBMC To 
increase the q^cificity of the ^btraction, the driver was spiked with genes that were 
15 determined to be most abundant in the mets36I6A cDNA library, such as EFl-alpha 
integrin-beta and anticoagulant protein PP4, as well as with cDNAs that were 
previously found to. be differentially expressed in subtracted lung adenocarcinoma 
CDNA libraries. The detennined cDNA sequences of 51 clones isolated from ti.e 
subtracted Kbrary (referred to as mets3616A-Sl) are provided in SEQ ID NO: 280-330. 
20 Comparison of the sequences of SEQ ID NO: 255-330 with those in the 

public databases revealed no significant homologies to the sequences of SEQ ID NO- 
255-258. 260. 262-264, 270. 272, 275. 276. 279, 281, 287, 291, 296. 300 and 310 The 
sequences of SEQ ID NO: 259, 261, 265-269, 271, 273. 274, 277, 278. 282-285 288- 
290. 292. 294, 297-299, 301, 303-309, 313, 314. 316, 320-324 and 326-330 showed 
25 some homology to previously identified gene sequences, while the sequences of SEQ ID 
NO: 280, 286, 293, 302, 310, 312, 3,5, 317-319 and 325 showed some homology to 
previously isolated expressed sequence tags (ESTs). 
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EXAMPLE 2 

DETERMINATION OF TISSUE SPECIFICITY OF LUNG TUMOR POLYPEPTIDES 
Using gene specific primers, mRNA expression levels for seven 
representative lung tumor polypeptides described in Example 1 were examined in a 
5 variety of normal and tumor tissues using RT-PCR. 

Briefly, total RNA was extracted from a variety of normal and tumor 
tissues using Trizol reagent as described above. First strand synthesis was carried out 
using 2 \ig of total RNA with Superscript II reverse transcriptase (BRL Life 
Technologies) at 42 ^C for one hour. The cDNA was then amplified by PGR with gene- 

10 specific primers. To ensure the semi-quantitative nature of the RT-PCR, P-actin was 
used as an internal control for each of the tissues examined. 1 |il of 1:30 dilution of 
cDNA was employed to enable the lineai* range amplification of the P-actin template 
and was sensitive enough to reflect the differences in the initial copy numbers. Using 
these conditions, the P-actin levels were determined for each reverse transcription 

IS reaction fi-om each tissue. DNA contanwiation was minimized by DNase treatment and 
by assuring a negative PCR result when using first strand cDNA that was prepared 
without adding reverse transcriptase. 

mRNA Expression levels were examined in five difierent types of tumor 
tissue (limg squamous cell carcinoma from 3 patients, lung adenocarcinoma, colon 

20 tumor from 2 patients, breast tumor and prostate tumor), and thirteen different normal 
tissues (lung from 4 donors, prostate, brain, kidney, liver, ovary, skeletal muscle, skin, 
small intestine, stomach, myocardiimi, retina and testes). Using a 10-fold amount of 
cDNA, the antigen LST-Sl-90 (SEQ ID NO: 3) was found to be expressed at high 
levels in lung squamous ceil carcinoma and in breast tumor, and at low to undetectable 

25 levels in the other tissues examined. 

The antigen LST-S2-68 (SEQ ID NO; 15) appears to be specific to lung 
and breast tumor, however, expression was also detected in normal kidney. Antigens 
LST-SM69 (SEQ ID NO: 6) and LST-Sl-133 (SEQ ID NO: 5) appear to be very 
abundant in lung tissues (both normal and tumor), with the expression of these two 

30 genes being decreased in most of the normal tissues tested. Both LST-S 1-169 and LST- 
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Sl-133 were also expressed in breast and colon tumors. Antigens LST.Sl-6 (SEQ ID 
NO: 7) and LST-S2-I2-5F (SEQ JD NO: 47) did not show tumor or tissue specific 
expression, with the expression of LST-Sl-28 being rare and only detectable in a few 

tissues. The antigenLST-S3-7(SEQID NO: 63)showedlungandbrea^tumorspecific 
expression, with its message only being detected in normal testes when the PGR was 
performed for 30 cycles. Lower level expression was detected in some normal tissues 
when the cycle number was increased to 35. Antigen LST-S3-13 (SEQ ID NO- 66) was 
found to be expressed in 3 out of 4 lung tumor., one breast tumor and both colon tumor 
samples. Its expression in nonnal tissues was lower compared to tumors, and was only 
10 detected in 1 out of 4 normal lung tissues and in normal tissues from kidney, ovary and 
retma. Expression of antigens LST-S3-4 (SEQ JD NO: 62) and LST-S3-14 (SEQ ID 
NO: 67) was rare and did not show any tissue or tumor specificity. Consistent with 
Northern blot analyses, the RT-PCR results on antigen LAT-Sl-A-lOA (SEQ ID NO- 
78) suggested that its expression is high in lung, colon, stomach and small intestine 
15 tissues, including lung and colon tumors, whereas its expression was low or 
undetectable in other tissues. 

A tow of 2(XC cDNA ftagmeote isolated ta lung sal*m*kas I n and 
in, descnbed abov^ ^ colony PCR amplified and d^i, ntf!NA expression levels in 
tag «nnor, nornd ^ ^ ^ ^ «.d Omor tissues «^ detennined 
20 miwana, technology (SyMcni. Pate Alto, CA). Briefly, to PCR amplification 

P«.d»c« we do«ed onto sUdes in an array fonna^ ^ ^ ^^^^^ ^ 

unrqne location in the array. nJWA was extracted ft«n fl« tissue sample to be tested 
revetse tiansaibed. and fluoreseent^abeled cDNA ptobes were genenited TOe' 
nncoarrays were pK,bed with ti,. labeled cDNA probes, the sUdes scanned and 
25 fluorescence intensity's measured. This intensity correlates witi, tire hybridization 
mtensity. Seventeen non-redundant cDNA clones showed over-expt^ssion in lung 
squamous tiunors, witi, expression in normal tissues tested flung, skin, lymph node 
colon, liver, pancreas, breast, heart, bou. marrow, large intestine, kidney, stomach, 
hr«„. small intestine, bladder and salivary gland) being e,ti,er undetectable, or lolfcld 
less compared to lung squamous timrors. Tbe detenmned cDNA sequences for tire 
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clone L513S are provided in SEQ ID NO: 87 and 88; those for L514S are provided in 
SEQ ID NO: 89 and 90; those for L516S in SEQ ID NO: 91 and 92; that for L517S in 
SEQ ID NO: 93; that for L519S in SEQ ID NO: 94; those for L520S in SEQ ID NO: 95 
and 96; those for L521S in SEQ ID NO: 97 and 98; that for L522S in SEQ ID NO: 99; 
5 that for L523S in SEQ ID NO: 100; that for L524S m SEQ ID NO: 101; that for L525S 
in SEQ ID NO: 102; that for L526S in SEQ ID NO: 103; that for L527S in SEQ ID NO: 
104; that for L528S in SEQ ID NO: 105; that for L529S in SEQ ID NO: 106; and those 
for L530S in SEQ ID NO: 107 and 108. Additionally, the full-length cDNA sequence 
for L530S is provided in SEQ ID NO: 151, with the corresponding amino acid sequence 

10 being provided in SEQ ID NO: 152. L530S shows homology to a splice variant of a 
p53 tumor suppressor homologue, p63. The cDNA sequences of 7 known isoforms of 
p63 are provided in SEQ ID NO: 331-337, with the corresponding amino acid 
sequences being provided in SEQ ID NO: 338-344, respectively. 

Due to polymorphisms, the clone L531S appears to have two forms. A 

15 first deteraiined full-length cDNA sequence for L531S is provided in SEQ ID NO: 109, 
with the corresponding amino acid sequence being provided in SEQ ID NO: 110. A 
second determined full-length cDNA sequence for L531S is provided in SEQ ID NO: 
111, with the corresponding amino acid sequence bemg provided in SEQ ID NO: 112. 
The sequence of SEQ ID NO: 1 1 1 is identical to that of SEQ ID NO: 109, except that it 

20 contains a 27 bp insertion. Similarly, L514S has two alternatively spliced forms; the 
first variant cDNA is listed as SEQ ID NO: 153, with the corresponding amino acid 
sequence being provided in SEQ ID NO: 155. The full-length cDNA for the second 
variant form of L514S is provided in SEQ ID NO: 154, with the corresponding amino 
acid sequence being provided in SEQ ID NO: 156. 

25 Full length cloning for L524S (SEQ ID NO: 101) yielded two variants 

(SEQ ID NO: 163 and 164) with the corresponding amino acid sequences of SEQ ID 
NO: 165 and 166, respectively. Both variants have been shown to encode parathyroid 
hormone-related peptide. 

Attempts to isolate the fijlHength cDNA for L519S, resulted m the 

30 isolation of the extended cDNA sequence provided in SEQ ID NO: 1 73, which contains 
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a po.»M ope» fading Th. «^ „u «pe„„e «Koded by «,«e„ce of 

SEQ ID NO: 173 is provided ta SEQ ID NO: 174. AddWo^ly. to fuIHengfc cDNA 
secuencc for *e clo« of SEQ m NO: 1<» (b,„™ ^ a known gene, is 

provided in SEQ ID NO: 175, ^ o„^^ 
5 ^vided in SEQ ID NO: .76. fa «rt,er andies. a m-t^^ oDNA sequence for 
L523S was rsolared fta a L523S-p««iv. ta»r cDNA library by PCR a^pliflearion 
o^t gene sf^Bo primers designed Bon, tie sequence of SEQ ID NO- 175 n,e 
detenmned Ml-lengU, cDNA sequence is provided in SEQ ID NO: 347 The amino 
acid se,ne«:e e«xxled by sequence is provided in SEQ ID NO: 348. nris protein 
■0 sequel* dife, «om *e previously published protein sequence at ,« anrino acid 
positions, namely at positions 158 and 410. 

Comparison of the sequences of L514S and L531S (SEQ ED NO- 87 and 
88, and 109, r^ectively) with those in the gene bank, as described above, .wealed no 
significant homologies to known sequences. Th. sequences of L513S, L516S, L517S 
15 L519S, L520S and L530S (SEQ ID NO: 87 and 88, 91 and 92, 93, 94, 95 and 96 107 
and 108, respectively) were found to show som. homology to previously identified 
ESTs. Tke sequences of L521S, L522S, L523S, L524S. L525S, L526S, L527S. L528S 
and L529S (SEQ ID NO: 97 and 98, 99, 99, 101, 102. ,03. ,04, ,05, and ,06 
respectively) were foum, to represent knoMO. genes, lie detennined fiUMength cDNA 
20 sequence for L520S is provided in SEQ ID NO: 113. with the corresponding amino 
acid sequence being provided in SEQ ID NO: 1,4. Subsequent microairay analysis 
showed L520S to be ove.xpr.ssed in breast t^ors in addition to lung squamous 



tumors. 



Further analysis demonstrated that L529S (SHQ ID NO: 106 »,d 1 15) 
25 L525S (SEQ ID NO: .02 and ,20) and L527S (SEQ m NO: 104, a. eytoskeletai 
componeMs and poienflally squamous cell specific proteins. L529S is c«m«dn 26 a 
M. jnncdon pr«ein. It was found to be highly expressed in one lung squamous *m,lr 
-teed to as W8T, and moderately over-expressed in .„„ others. However, lowei 
level expression of comrexin 26 is also detectable in nomral skin, colon. Hver and 
stomach. Tie over^expression of connexin 26 in some breast tnmora has bea reported 
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and a mutated form of L529S may result in over-expression in lung tumors. L525S is 
plakophilin 1, a desmosomal protein found in plaque-bearing adhering junctions of the 
skin. Expression levels for L525S mRNA was highly elevated in three out of four lung 
squamous tumors tested, and in normal skin. L527S has been identified as keratin 6 
5 isoform, type II 58 Kd keratin and cytokeratin 13, and shows over-expression in 
squamous tumors and low expression in normal skin, breast and colon tissues. Keratin 
and keratin-related genes have been extensively documented as potential markers for 
lung cancer including CYFRA2.1 (Pastor, A., et al, Eur. Respir. J., 10:603-609, 1997). 
L513S (SEQ ID NO: 87 and 88) shows moderate over-expression in several tumor 
10 tissues tested, and encodes a protein that was first isolated as a pemphigus vulgaris 
antigen. 

L520S (SEQ ID NO: 95 and 96) and L521S (SEQ ID NO: 97 and 98) are 
highly expressed in lung squamous tumors, with L520S being up-regulated in normal 
salivary gland and L521S being over-expressed in normal skin. Both belong to a family 

15 of small proline rich proteins and represent markers for fiilly differentiated squamous 
cells. L521 S has been described as a specific marker for limg squamous tumor (Hu, R., 
et al. Lung Cancer, 20:25-30, 1998). L515S (SEQ ID NO: 162) encodes IGF-P2 and 
L516S is an aldose reductase homologue. Both are moderately expressed in lung 
squamous tumors and in normal colon. Notably, L516S (SEQ ID NO: 91 and 92) is up- 

20 regulated in metastatic tumors but not primary lung adenocarcinoma, an indication of its 
potential role in metatasis and a potential prognostic marker. L522S (SEQ ID NO: 99) 
is moderately over-expressed in lung squamous tumors with minimum expression in 
normal tissues. L522S has been shown to belong to a class IV alcohol dehydrogenase, 
ADH7, and its expression profile suggests it is a squamous cell specific antigen. L523S 

25 (SEQ ID NO: 100) is moderately over-expressed in lung squamous tumor, human 
pancreatic cancer cell lines and pancreatic cancer tissues, suggesting this gene may be a 
shared antigen between pancreatic and lung squamous cell cancer. 

L524S (SEQ ID NO: 101) is over-expressed in the majority of squamous 
tumors tested and is homologous with parathyroid hormone-related peptide (PTHrP), 

30 which is best known to cause humoral hypercalcaemia associated with malignant 
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tumors such as leukemia, prostate and breast cancer. It is also beUeved fliat PTHrP is 
most commonly associated with squamous carcinoma of lung and laiely with lung 
adenocarcinoma (Davidson. L.A., et al. J. Pathol, 178: 398-401, 1996). L528S (SEQ 
ID NO: 105) is highly over-expressed in two lung squamous tumors with moderate 
5 expression in two other squamous tumors, one lung adenocarcinoma and some noimal 
tissues, including skin, lymph nodes, heart, stomach and lung. It encodes the NMB 
gene that is similar to the precursor of melanocyte specific gene Pmell7, which is 
reported to be preferentially expressed in low-metastatic potential melanoma cell lines. 
This suggests that L528S may be a shared antigen in both melanoma and lung squamous 
10 cellcarcinoma. L526S (SEQ ID NO: 103) was overexpressed in all lung squamous cell 
tumor tissues tested and has been shown to share homology with a gene (ATM) in 
which a mutation causes ataxia telangiectasia, a genetic disorder in humans causing a 
predisposition to cancer, among other symptoms. ATM encodes a protein that activates 
a p53 mediated ceU-cycle checkpoint through direct binding and phosphorylation of the 
15 p53 molecule. Approximately 40% of lung cancers are associated with p53 mutations, 
and it is speculated that over-expression of ATM is a result of compensation for loss of 
p53 fimction, but it is unknown whether over-expression is the cause of result of lung 
squamous ceU carcinoma. Additionally, expression of L526S (ATM) is also detected in 
ametastatic but not lung adenocarcinoma, suggesting a role in metastasis. 
20 Expression of L523S (SEQ ID NO: 175), was examined by real time 

RT-PCR as described above. In a first study using a panel of lung squamous tumors, 
L523S was found to be expressed in 4/7 lung squamous tumors, 2/3 head and neck 
squamous tumors and 2/2 lung adenocarcinomas, with low level expression being 
observed in skeletal muscle, soft palate and tonsil. In a second study using a lung 
25 adenocarcinoma panel, expression of L523S was observed in 4/9 primary 
adenocarcinomas, 2/2 lung pleural effusions, 1/1 metastatic lung adenocarcinomas and 
2/2 lung squamous tumors, with little expression being observed in normal tissues. 

Expression of L523S in lung tumors and various normal tissues was also 
examined by Northern blot analysis, using standard techniques. In a first study, L523S 
30 was found to be expressed in a number of lung adenocarcinomas and squamous cell 
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carcinomas, as well as normal tonsil. No expression was observed in normal lung. In a 
second study using a normal tissue blot (referred to as HB-12) from Clontech, no 
expression was observed in brain, skeletal muscle, colon, thymus, spleen, kidney, liver, 
small intestine, limg or PBMC, although there was strong expression in placenta, 

5 

EXAMPLES 

ISOLATION AND CHARACTERIZATION OF LUNG TUMOR POLYFEFITDES 
BY PCR-BASED SUBTRACTION 
Eight hundred and fifty seven clones from a cDNA subtraction library, 
10 containing cDNA from a pool of two human lung squamous tumors subtracted against 
eight normal human tissue cDNAs including lung, PBMC, brain, heart, kidney, liver, 
pancreas, and skin, (Clontech, Palo Alto, CA) were derived and submitted to a first 
round of PCR amplification. This library was subjected to a second roimd of PCR 
amplification, following the manufacturer's protocol. The resulting cDNA fragments 
15 were subcloned into the P7-Adv vector (Clontech, Palo Alto, CA) and transformed into 
DH5a £. coli (Gibco, BRL). DNA was isolated from independent clones and 
sequenced using a Perkin Elmer/Applied Biosystems Division Automated Sequencer 
Model 373A. 

One hundred and sixty two positive clones were sequenced. Comparison 
20 of the DNA sequences of these clones with those in the EMBL and GenBank databases, 
as described above, revealed no significant homologies to 13 of these clones, hereinafter 
referred to as Contigs 13, 16, 17, 19, 22, 24, 29, 47, 49, 56-59. The determined cDNA 
sequences for these clones are provided in SEQ ID NO: 125, 127-129, 131-133, 142, 
144, 148-150, and 1 57, respectively. Contigs 1,3-5,7-10, 12, 11, 15,20,31,33,38,39, 
25 41, 43, 44, 45, 48, 50, 53, 54 (SEQ ID NO: 115-124, 126, 130, 134-141, 143, 145-147, 
respectively) were found to show some degree of homology to previously identified 
DNA sequences. Contig 57 (SEQ ID NO: 149) was found to represent the clone L519S 
(SEQ ID NO: 94) disclosed m US. Patent Application No. 09/123,912, filed July 27, 
1998. To the best of the inventors* knowledge, none of these sequences have been 
30 previously shown to be diflferentially over-expressed in lung tumors. 
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mRNA expression levels for representative clones in lung tumor tissues, 
nonnal lung tissues (n=4). resting PBMC, salivary gland, heart, stomach, lymph nodes, 
skeletal muscle, soft palate, small intestme, large intestine, bronchial, bladder, tonsil, 
kidney, esophagus, bone marrow, colon, adrenal gland, pancreas, and skin (all derived 
5 from human) were detennined by RT-PCR as described above. Expression levels using 
midoarray technology, as described above, were examined in one sample of each tissue 
type unless otherwise indicated. 

Contig 3 (SEQ ID NO: 1 1 6) was found to be highly expressed in all head 
and neck squamous cell tumors tested (17/17), and expressed in the majoxily (8/12) of 
10 lung squamous tumors, (high expression in 7/12, moderate in 2/12, and low in 2/12), 
while showing negative expression for 2/4 nonnal lung tissues and low expression in 
the remaining two samples. Contig 3 showed moderate expression in skm and soft 
palate, and lowered expression levels in resting PBMC, large intestine, salivary gland, 
tonsil, pancreas, esophagus, and colon. Contig 1 1 (SEQ ID NO: 124) was found to be 
15 expressed in all head and neck squamous cell tumors tested (17/17), with high levels of 
expression being seen in 14/17 tumors, and moderately levels of expression being seen 
in 3/17 tumors. Additionally, high expression was seen in 3/12 hmg squamous tumors 
and moderate expression in 4/12 lung squamous tumors. Contig 1 1 was negative for 3/4 
normal lung samples, with the remaining sample having only low expression. Contig 
20 1 1 showed low to moderate reactivity to salivary gland, soft palate, bladder, tonsil, skin, 
esophagus, and large intestine. Contig 13 (SEQ ID NO: 125) was found to be expressed 
in all head and neck squamous cell tumors tested (17/17), with high expression in 
12/17, and moderate expression in 5/17. Contig 13 was expressed in 7/12 lung 
squamous tumors, with high expression in 4/12 and moderate expression in three 
25 samples. Analysis of normal lung samples showed negative expression for 2/4 and low 
to moderate expression in the remaining two samples. Contig 13 showed low to 
moderate reactivity to resting PBMC, salivary gland, bladder, pancreas, tonsil, skin, 
esophagus, and large intestine, as well as high expression in soft palate. Subsequent 
foU-length cloning efforts revealed that contig 13 (also known as L761P) maps to the 3' 
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untranslated region of the liSeclOp gene. The full-length sequence for this gene is set 
forth in SEQ ID NO: 368, and encodes the protein set forth in SEQ ID NO: 369. 

Contig 16 (SEQ ID NO: 127) was found to be moderately expressed in 
several head and neck squamous cell tumors (6/17) and one lung squamous tumor, 
5 while showing no expression in any normal lung samples tested. Contig 16 showed low 
reactivity to resting PBMC, large intestine, skin, salivary gland, and soft palate. Contig 
17 (SEQ ID NO: 128) was shown to be expressed in all head and neck squamous cell 
tumors tested (17/17) (highly expressed in 5/17, and moderately expressed in 12/17). 
Determination of expression levels in lung squamous tumors showed one tumor sample 

10 with high expression and 3/12 with moderate levels. Contig 17 was negative for 2/4 
normal lung samples, with the remaining samples having only low expression. 
Additionally, low level expression was foimd in esophagus and soft palate. Contig 19 
(SEQ ID NO: 129) was found to be expressed in most head and neck squamous cell 
tumors tested (11/17); with two samples having high expression levels, 6/17 showing 

15 moderate expression, and low expression being found in 3/17. Testing in lung 
squamous tmnors revealed only moderate expression in 3/12 samples. Expression 
levels in 2/4 of normal lung samples were negative, the two other samples having only 
low expression. Contig 19 showed low expression levels in esophagus, resting PBMC, 
salivary gland, bladder, soft palate and pancreas. 

20 Contig 22 (SEQ ID NO: 131), was shown to be expressed in most head 

and neck squamous cell tumors tested (13/17) with high expression in four of these 
samples, moderate expression in 6/17, and low expression in 3/17. Expression levels in 
lung squamous txmiors were found to be moderate to high for 3/12 tissues tested, v^th 
negative expression in two normal lung samples and low expression in two other 

25 samples (n=4). Contig 22 showed low expression in skin, salivary gland and soft 
palate. Similarly, Contig 24 (SEQ ID NO: 1 32) was found to be expressed in most head 
and neck squamous cell tumors tested (13/17) with high expression in three of these 
samples, moderate expression in 6/17, and low expression in 4/17. Expression levels in 
lung squamous tumors were found to be moderate to high for 3/12 tissues tested, with 

30 negative expression for three normal lung samples and low expression in one sample 
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(n=4). Contig 24 showed low expression in skin, saHvaiy gland and soft palate. 
Contig 29 (SEQ ID NO: 133) was expressed in nearly all head and neck squamous cell 
tumors tested (16/17): highly expressed in 4/17, moderately expressed in 11/17, with 
low expression in one sample. Also, it was moderately expressed in 3/12 lung 
5 squamous tumors, while being negative for 2/4 normal lung samples. Contig 29 
showed low to 'moderate expression in large mtestine, skin, salivary gland, pancreas, 
tonsil, heart and soft palate. Contig 47 (SEQ ID NO: 142) was expressed in most head 
and neck squamous ceU tumors tested (12/17): moderate expression in 10/17, and low 
expression in two samples. In lung squamous tumors, it was highly expressed in one 
10 sample and moderately expressed in two others (n=13). Contig 47 was negative for 2/4 
normal lung samples, with the remaining two samples having moderate expression. 
Also, Contig 47 showed moderate expression in large intestine, and pancreas, and low 
expression in skin, salivary gland, soft palate, stomach, bladder, resting PBMC, and 
tonsil. 

C°°«g 48 (SEQ ID NO: 143) was expressed in all head and neck 
squamous cell tumors tested (17/17): highly expressed in 8/17 and moderately 
expressed in 7/17, with low expression in two samples. Expression levels in lung 
squamous tumors were high to moderate in three samples (n=13). Contig 48 was 
negative for one out of four normal Imig samples, the remaining showing low or 

20 moderate expressioa Contig 48 showed moderate expression in soft palate, large 
intestine, pancreas, and bladder, and low expression in esophagus, salivary gland, 
resting PBMC, and heart. Contig 49 (SEQ ID NO: 144) was expressed at low to 
moderate levels in 6/17 head and neck squamous cell tumors tested. Expression levels 
in lung squamous tumors were moderate in three samples (n=13). Contig 49 was 

25 negative for 2/4 normal lung samples, the remaining samples showing low expression. 
Moderate expression levels in skin, salivary gland, large intestine, pancreas, bladder and 
resting PBMC were shown, as well as low expression in soft palate, lymph nodes, and 
tonsil. Contig 56 (SEQ ID NO: 148) was expressed in low to moderate levels in 3/17 
head and neck squamous cell tumors tested, and in lung squamous tumors, showing low 

30 to moderate levels in three out of thirteen samples. Notably, low expression levels were 
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detected in one adenocarcinoma lung tumor sample (n=2). Contig 56 was negative for 
3/4 normal lung samples, and showed moderate expression levels in only large intestine, 
and low expression in salivary gland, soft palate, pancreas, bladder, and resting PBMC. 
Contig 58, also known as L769P, (SEQ ID NO: 150) was expressed at moderate levels 
5 in 11/17 head and neck squamous cell tumors tested and low expression in one 
additional sample. Expression in lung squamous tumors showed low to moderate levels 
in three out of thirteen samples. Contig 58 was negative for 3/4 normal lung samples, 
with one sample having low expression. Moderate expression levels in skin, large 
intestine, and resting PBMC were demonstrated, as well as low expression in salivary 

10 gland, soft palate, pancreas, and bladder. Contig 59 (SEQ ID NO: 157) was expressed in 
some head, neck, and lung squamous tumors. Low level expression of Contig 59 was 
also detected in salivary gland and large intestine. 

The fiill-length cDNA sequence for Contig 22, also referred to as L763P, 
is provided in SEQ ID NO: 158, with the corresponding amino acid sequence being 

1 5 provided in SEQ ID NO: 159. Real-time RT-PCR analysis of L763P revealed that it is 
highly expressed in 3/4 lung squamous tumors as well as 4/4 head and neck squamous 
tumors, with low level expression being observed in normal brain, skin, soft pallet and 
trachea. Subsequent database searches revealed that the sequence of SEQ ID NO: 158 
contains a mutation, resulting in a firameshifi in the corresponding protein sequence. A 

20 second cDNA sequence for L763P is provided in SEQ ID NO: 345, with the 
corresponding amino acid sequence being provided in SEQ ID NO: 346. The sequences 
of SEQ ID NO: 1 59 and 346 are identical with the exception of the C-terminal 33 amino 
acids of SEQ ID NO: 159. 

The full-length cDNA sequence incorporating Contigs 17, 19, and 24, 

25 refened to as L762P, is provided in SEQ ID NO: 160, wdth the corresponding amino 
acid sequence being provided in SEQ ID NO: 161. Further analysis of L762P has 
determined it to be a type I membrane protein and two additional variants have been 
sequenced. Variant 1 (SEQ ID NO: 167, with the corresponding amino acid sequence 
in SEQ ID NO: 169) is an alternatively spliced form of SEQ ID NO: 160 resulting in 

30 deletion of 503 nucleotides, as well as deletion of a short segment of the expressed 
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protein. Variant 2 (SEQ ID NO: 168, with the con^sponding amino add sequence in 
SEQ ID NO; 170) has a two nucleotide deletion at the 3' coding region in companson 
to SEQ ID NO: 160, resulting in a secreted fonn of the expressed protein. Real-time 
RT-PCR analysis of L762P revealed that is over-expressed in 3/4 lung squamous 
5 tumors and 4/4 head & neck tumors, with low level expression being observed in 
nonnal skin, soft pallet and trachea. 

An epitope of L762P was identified as having the sequence 
KPGHWTYTLNNTHHSLQALK (SEQ ID NO: 382), which corresponds to amino 
acids 571-590 of SEQ ID N0:161. 

fuU-length cDNA sequence for contig 56 (SEQ ID NO: 148), also 
refened to as L773P, is provided in SEQ ID NO: 171, with the amino acid sequem« in 
SEQ ID NO: 172. L773P was found to be identical to dihydroxyl dehydrogenase at the 
3' portion of the gene, with divergent 5' sequence. As aresult, the 69 N-temiinal amino 
acids are unique. The cDNA sequence encoding the 69 N-tenninal amino adds is 

1 5 provided in SEQ ID NO: 349, with the N-tenninal amino acid sequence bdng provided 
in SEQ ID NO: 350. Real-tune PGR revealed that L773P is highly expressed in lung 
squamous tumor and Imig adenocarcinoma, with no detectable expression in nonnal 
tissues. Subsequent Northern blot analysis of L773P demonstrated that this transcript is 
differentiaily over-expressed in squamous tumors and detected at approximately 1.6 Kb 

20 in primary lung tumor tissue and approxunately 1.3 Kb in primary head and neck tmnor 



tissue. 



Subsequent microanay analysis has shown Contig 58, also referred to as 
L769S (SEQ ID NO: 150), to be overexpressed in breast tumors in addition to lung 
squamous tumors. 



25 



EXAMPLE 4 

ISOLATION ATvin m AP ACTERIZATTON OF LUNG TI JM QR POLYPEPTTDFR 
BY PCR-BASE D SUBTR ACTTnTsT 
Seven hundred and sixty dones from a cDNA subtraction library, 
30 containing cDNA from a pool of two human lung primary adenocarcinomas subtracted 
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against a pool of nine normal human tissue cDNAs including skin, colon, lung, 
esophagus, brain, kidney, spleen, pancreas and liver, (Clontech, Palo Alto, CA) were 
derived and submitted to a first round of PGR amplification. This library (referred to as 
ALT-1) was subjected to a second round of PGR amplification, following the 
S manu&cturer's protocol. The expression levels of these 760 cDNA clones in limg 
tumor, normal lung, and various other normal and tumor tissues, were examined using 
microarray technology (Incyte, Palo Alto, GA). Briefly, the PGR amplification products 
were dotted onto slides in an array format, with each product occupying a unique 
location in the array. mRNA was extracted fit)m the tissue sample to be tested, reverse 

10 transcribed, and fluorescent-labeled cDNA probes were generated. The microarrays 
were probed with the labeled cDNA probes, the slides scaimed and fluorescence 
intensity was measured. This intensity correlates with the hybridization intensity.. A 
total of 118 clones, of which 55 were unique, were foimd to be over-expressed in lung 
tumor tissue, vrtth expression in normal tissues tested (limg, skin, lymph node, colon, 

15 liver, pancreas, breast, heart, bone marrow, large intestine, kidney, stomach, brain, small 
intestine, bladder and salivary gland) being either undetectable, or at significantly lower 
levels. One of these clones, having the sequence as provided in SEQ ID NO:420 (clone 
#19014), shows homology to a previously identified clone, L773P. Glone L773P has 
the fulWength cDNA sequence provided in SEQ ID NO: 171 and the amino acid 

20 sequence provided in SEQ ID NO:172 The isolation of clone #19014 is also described 
in co-pendmg U.S. Patent application 09/285,479, filed April 2, 1999. 

EXAMPLES 
SYNTHESIS OF POLYPEPTIDES 
25 Polypeptides may be synthesized on a Perkin Elmer/Applied Biosystems 

Division 430A peptide synthesizer using FMOC chemistry with HPTU (O- 
Benzotriazole-N,N,N'J^'-tetramethyluronium hexafluorophosphate) activation. A Gly- 
Gys-Gly sequence may be attached to the amino terminus of the peptide to provide a 
method of conjugation, binding to an immobilized surface, or labeling of the peptide. 
30 Gleavage of the peptides fi'om the solid support is carried out using the foUowing 
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cleavage mixture: trifluoroacetic acid:ethanedithiol:thioanisole.-water:phenol 
(40:1 :2:2:3). After cleaving for 2 hours, the peptides are precipitated in cold methyl-t- 
butyl-ether. The peptide pellets are then dissolved in water containing 0.1% 
trifluoroacetic acid (TFA) and lyophilized prior to purification by C18 reverse phase 
5 HPLC. A gradient of 0%^0% acetonitrile (contaming 0.1% TFA) in water (containing 
0.1% TFA) may be used to elute the peptides. Following lyophilization of the pure 
firactions, the peptides are characterized using electrospray or other types of mass 
spectrometry and by amino acid analysis. 

^® EXAMPLE 6 

PREP ARATION OF ANTlBODmS AGAINST LTTNG CANCER ANTTPtKnts 

Polyclonal antibodies against the lung cancer antigens L514S, L528S, 
L531S, L523 and L773P (SEQ ID NO: 155, 225, 112, 176 and 171, respectively) were' 
prepared as follows. 

Rabbits were immunized with recombinant protein expressed in and 
purified fi-om E. coli as described below. For the initial immunization, 400 ^lg of 
antigen combined with muramyl dipeptide (MDP) was injected subcutanebusly (S.C.). 
Animals were boosted S.C. 4 weeks later with 200 ng of antigen mixed with incomplete 
Freund's Adjuvant (IFA). Subsequent boosts of 100 jig of antigen mixed with IFA 
20 were injected S.C. as necessary to induce high antibody titer responses. Serum bleeds 
firom immunized rabbits were tested for antigen-specific reactivity using EUSA assays 
with purified protein. Polyclonal antibodies against L5I4S, L528S, L531S, L523S and 
L773P were affinity purified fi-om high titer polyclonal sera using purified protein 
attached to a solid support. 
^ Immunohistochemical analysis using polyclonal antibodies against 

L514S was performed on a panel of 5 lung tumor samples, 5 normal lung tissue samples 
and normal colon, bdney, liver, brain and bone marrow. Specifically, tissue samples 
were fixed in formalin solution for 24 hours and embedded in paraffin before being 
sliced into 10 micron sections. Tissue sections were permeabilized and incubated with 
30 antibody for 1 hr. HRP-labeled anti-mouse foUowed by incubation with DAB 
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chromogen was used to visualize L514S immunoreactivity. L514S was found to be 
highly expressed in lung tumor tissue with little or no expression being observed in 
normal lung, brain or bone marrow. Li^t staining was observed in colon (epithelial 
crypt cells positive) and kidney (tubules positive). Staining was seen in normal liver but 
5 no mRNA has been detected in this tissue making this result suspect. 

Using the same procedure, immunohistochemical analysis using 
polyclonal antibodies against LS28S demonstrated staining in lung tumor and normal 
lung samples, light staining in colon and kidney, and no staining in liver and heart 

Immunohistochemical analysis using polyclonal antibodies against 

10 LS31S demonstrated staining in lung tumor samples, light membrane staining in most 
normal lung samples, epithelial staining in colon, tubule staining in kidney, ductal 
epithelial staining in liver and no staining in heart. 

Immunohistochemical analysis using polyclonal antibodies against 
L523S demonstrated staining in all lung cancer samples tested but no staining in normal 

15 limg, kidney, liver, colon, bone marrow or cerebellum. 

Generation of polyclonal anti-sera against L762P (SEQ ID NO: 169 and 
170) was performed as follows. 400 micrograms of Ixmg antigen was combined with 
100 micrograms of muramyldipeptide (MDP). An equal volume of Incomplete 
Freimd's Adjuvant (IFA) was added and then mixed until an emulsion was formed. 

20 Rabbits were injected subcutaneously (S.C.). After four weeks the animals were 
injected S.C. with 200 micrograms of antigen mixed with an equal volume of DFA. 
Every four weeks animals were boosted with 100 micrograms of antigen. Seven days 
following each boost the animal was bled. Sera was generated by incubating the blood 
at 4^C for 12-24 hours followed by centrifugation. 

25 Characterization of polyclonal antisera was carried out as follows. 

Ninety-six well plates were coated with antigen by incubing with 50 microliters 
(typically 1 microgram) at 4^C for 20 hrs. 250 microliters of BSA blocking buffer was 
added to the wells and incubated at room temperatiure for 2 hrs. Plates were washed 6 
times with PBS/0.01% Tween. Rabbit sera was diluted in PBSand 50 microliters of 

30 diluted sera was added to each well and incubated at room temperature for 30 min. 
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Plates were washed as described above before addition of 50 microliters of goat anti- 
rabbit horse radish peroxidase (HRP) at a 1:10000 dilution and incubation at room 
temperature for 30 min. Plates were washed as described above and lOOfJ of TMB 
Microwell Peroxidase Substrate was added to each well. Following a 15 minute 
5 incubation in tiie dark at room temperature, tiie colorimetric reaction was stopped with 
100^1 IN H2SO4 and read immediately at 450 nm. Antisera showed strong reactivity to 
antigen L762P. 

Immunohistochemical analysis using polyclonal antibodies against 
L762P demonstrated staining in all lung cancer samples tested, some light staining in 
10 the bronchiole epithelium of normal lung, tubule staining in kidney, light epitiielial 
staining in colon and no staining in heart or liver. 

In order to evaluate L773P protein expression in various tissues, 
immunohistochemistiy QUC) analysis was performed using an affinity purified L773P 
polyclonal antibody. Briefly, tissue samples were fixed in fonnalin solution for 12-24 
15 hrs and embedded in paraffin before being sliced into 8 micron sections. Steam heat 
induced epitope retiieval (SHIER) in 0.1 M sodiuym citrate buffer (pH 6.0) was used 
for optimal staming conditions. Sections were incubated wifli 10% serum/PBS for 5 
minutes. Primary antibody was added to each section for 25 minutes at indicated 
conceutiations followed by 25 minute incubation wifli either anti-rabbit or anti-mouse 
20 biotinylated antibody. Endogenous peroxidase activitiy was blocked by ti«se 1.5 minute 
incubations witii hydrogen p«-oxidase. The avidm biotin complex/horse radish 
peroxidase (ABCTHRP) system was used along with DAB chromogen to visuahze 
L773P expression. Slides were counterstainied with hematoxylin to visualize cell 
nuclei. Using tiiis approach, L773P protein was detected in 6/8 lung tumors, 4/6 normal 
25 lung samples (very light staining in some cases), l/I kidney samples (veiy light 
staining), 0/1 heart samples, 1/1 colon samples (very light staining) and 0/1 Uver 
samples. 
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EXAMPLE? 

PEPTIDE PRIMING OF MICE AND PROPAGATION OF CTL LINES 

Immunogenic peptides from the limg cancer antigen L762P (SEQ ID 
5 NO: 1 61) for HLA-A2/K**-restricted CD8+ T cells were identified as follows. 

The location of HLA-A2 binding peptides within the lung cancer antigen 
L762P (SEQ ID NO: 161) was predicted using a computer program which predicts 
peptides sequences likely to being to HLA-A*0201 by fitting to the known peptide 
binding motif for HLA-A*0201 (Rupert et al (1993) Cell 74:929; Rammensee et al 
10 (1995) Immunogenetics 47:178-228). A series of 19 synthetic peptides corresponding 
to a selected subset of the predicted HLA-A*0201 binding peptides was prepared as 
described above. 

Mice expressing the transgene for human HLA A2/K** (provided by Dr L. 
Sherman, The Scripps Research Institute, La JoUa, CA) were immunized with the 

15 synthetic peptides, as described by Theobald et al., Proc, Natl Acad, Sci. USA 
P2:11993-11997, 1995, with the following modifications. Mice were immunized with 
50}ig of L726P peptide and 120|ag of an I-A** binding peptide derived from hepatitis B 
virus protein emulsified in incomplete Freimd's adjuvant Three weeks later these mice 
were sacrificed and single cell suspensions prepared. Cells were then resuspended at 7 

20 X 10^ cells/ml in complete media (RPMI-1$40; Gibco BRL, Gaitiiersburg, MD) 
containing 10% FCS, 2mM Glutamine (Gibco BRL), sodium pyruvate (Gibco BRL), 
non-essential amino acids (Gibco BRL), 2 x 10"^ M 2-mercaptoethanol, 50U/ml 
penicillin and streptomycin, and cultured in the presence of irradiated (3000 rads) 
L762P peptide- (5[ig/ml) and lOmg/ml B2-microglobulin- (3 jig/ml) LPS blasts (A2 

25 transgenic spleens cells cultured in the presence of 7iig/ml dextran sulfate and 25ng/ml 
LPS for 3 days). After six days, cells (5 x lOVml) were restimulated with 2.5 x lOVml 
peptide-pulsed irradiated (20,000 rads) EL4A2Kb cells (Sherman et al. Science 
255:815-818, 1992) and 5 x lOVml irradiated (3000 rads) A2/K*'-transgenic spleen 
feeder cells. Cells were cultured in the presence of lOU/ml lL-2. Cells were 

30 restimulated on a weekly basis as described, in preparation for cloning the line. 
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Peptide-specific cell lines were cloned by limiting dilution analysis with 
irradiated (20,000 rads) L762P peptide-pulsed EU A2Kb tumor cells (1 x 10^ 
cells/well) as stimulators and irradiated (3000 lads) A2/K'>-transgenic spleen cells as 
feeders (5x 10^ cells/well)growninthepresenceoflOU/inlIL-2. On day 7, cells were 
5 restimulated as before. On day 14, clones that were growing were isolated and 
maintained in culture. 

Cell lines specific for the peptides L762P-87 (SEQ ID NO: 226; 
corresponding to amino acids 87-95 of SEQ ID NO: 161), L762P-145 (SEQ ID No! 
227; corresponding to amino acids 145-153 of SEQ ID NO: 161), L762P-585 (SEQ ID 
10 NO: 228; corresponding to amino acids 585-593 of SEQ ID NO: 161), L762P-425 
(SEQ ID NO: 229; corresponding to amino adds 425-433 of SEQ ID NO: 161), 
L762P(10)-424 (SEQ ID NO: 230; corresponding to amino acids 424-433 of SEQ ID 
NO: 161) and L762P(10H58 (SEQ ID NO: 231; corresponding to amino acids 458-467 
of SEQ ID NO: 161) demonstrated significantly higher reactivity (as measured by 
15 percent specific lysis) against L762P peptide-pulsed EL4-A2/K:'' tumor target cells than 
control peptide-pulsed EL4-A2/K'' tumor target cells. 



EXAMPLE 8 

IDENTMCATTON OF CTH TA41\4UNOGENTC T rPT T EPITOPPS DRRivpn ppnx>T 
THE LUNG CANCER ANTTnKN T ifnv 

CD4 T cell lines specific for the antigen L762P (SEQ ID NO: 161) were 
generated as follows. 

A series of 28 overlapping peptides were synthesized that spanned 
approximately 50% of the L762P sequence. For priming, peptides were combined into 

25 pools of 4-5 peptides, pulsed at 20 micrograms/ml into dendritic cells for 24 hours. The 
dendritic cells were then washed and mixed with positively selected CD4+ T cells in 96 
well U-bottomed plates. Forty cultures were generated for each peptide pool. Cultures 
were restimulated weekly with fresh dendritic cells loaded with peptide pools. 
Following a total of 3 stimulation cycles, cells were rested for an additional week and 

30 tested for specificity to antigen presenting cells (APC) pulsed with peptide pools using 
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interferon-gamma ELISA and proliferation assays. For these assays, adherent 
monocytes loaded with either the relevant peptide pool or an irrelevant peptide were 
used as APC. T cell lines that appeared to specifically recognize L762P peptide pools 
both by cytokine release and proliferation were identified for each pool. Emphasis was 
5 placed on identifying T cells with proliferative responses. T cell lines that demonstrated 
either both L762P-specific cytokine secretion and proliferation, or strong proliferation 
alone were further expanded to be tested for recognition of individual peptides firom the 
pools, as well as for recognition of recombinant L762P. TTie source of recombinant 
L762P was E, coli, and the material was partially purified and endotoxin positive. 

10 These studies employed 10 micrograms of individual peptides, 10 or 2 micrograms "of 
an irrelevant peptide, and 2 or 0.5 micrograms of either L762P protein or an irrelevant, 
equally impure, E, coli generated recombinant protein. Significant interferon-gamma 
production and CD4 T cell proliferation was induced by a number of L762P-derived 
peptides in each pool. The amino acid sequences for these peptides are provided in 

15 SEQ ID NO: 232-251. These peptides correspond to amino acids 661-680, 676-696, 
526-545, 874-893, 811-830, 871-891, 856-875, 826-845, 795-815, 736-755, 706-725, 
706-725, 691-710, 601-620, 571-590, 556-575, 616-635, 646-665, 631-650, 541-560 
and 586-605, respectively, of SEQ ID NO: 161. 

CD4 T ceU lines that demonstrated specificity for individual L762P- 

20 derived peptides were further expanded by stimulation with the relevant peptide at 10 
micrograms/ml. Two weeks post-stimulation, T cell lines were tested using both 
proliferation and IFN-gamma ELISA assays for recognition of the specific peptide. A 
nimiber of previously identified T cells continued to demonstrate L762P^peptide 
specific activity. Each of these lines was further expanded on the relevant peptide and, 

25 following two weeks of expansion, tested for specific recognition of the L762P-peptide 
in titration experiments, as well as for recognition of recombinant E. coli-d&nved L762P 
protein. For these experiments, autologous adherent monocytes were pulsed with either 
the relevant L762P-derived peptide, an irrelevant mammaglobin-derived peptide, 
recombinant E, coli-denvcd L762P (approx. 50% pure), or an irrelevant £ co//-derived 

30 protein. The majority of T cell lines were found to show low affinity for the relevant 
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peptide, since specific proliferation and IFN-gamma ratios dramatically decreased as 
L762P peptide was diluted. However, four lines were identified that demonstrated 
significant activity even at 0.1 micrograms/ml peptide. Each of these lines (refeired to 
as A/D5. D/F5. E/A7 and E/B6) also appeared to specifically proliferate in response to 
5 the £ co//-derived L762P protein preparation, but not in response to the irrelevant 
protein preparatioa The amino acid sequences of the L762P-deiived peptides 
recognized by these lines are provided in SEQ ID NO: 234, 249, 236 and 245, 
respectively. No protein specific IFN-gamma was detected for any of the lines. Lines 
A/D5, E/A7 and E/B6 were cloned on autologous adherent monocytes pulsed with the 
10 relevant peptide at 0.1 (A/D5 and E/A7) or 1 (D/F5) microgram/ml. Following growth, 
clones were tested for specificity for the relevant peptide. Numerous clones specific for 
the relevant peptide were identified for lines A/D5 and E/A7. 



EXAMPLE 9 

^5 P ROTEIN EXPRESSION OF T.TJNG TUMOR-SPECIFIC ANTTfiRNTS 

a) Expression of L514S in K nnli 

The lung tumor antigen L514S (SEQ ID NO: 89) was subcloned mto the 
expression vector pE32b at Ncol and NotI sites, and transformed into E. coli using 
standard techniques. The protein was expressed from residues 3-153 of SEQ ID NO: 
20 89. The expressed amino acid sequence and the conespondmg DNA sequence are 
provided in SEQ ID NO: 252 and 253, respectively. 

b) depression of L767P 

Amino acids 32-944 of the lung tumor antigen L762P (SEQ ID NO: 161), with a 6X His 
Tag. were subcloned into a modified pET28 expression vector, using kanamycin 
25 resistance, and transfoimed mto BL21 CodonPlus using standard techniques. Low to 
moderate levels of expression were observed. The determined DNA sequence of the 
L762P expression construct is provided in SEQ ID NO: 254. 



137 



wo 02/00174 



PCT/USOl/21065 



EXAMPLE 10 

IDENTIFICATION OF MHC CLASS U RESTRICTING ALLELE FOR L762P 
PEPTIDE-SPECIFIC RESPONSES 
S A panel of HLA mismatched antigen presenting cells (APC) were used 

to identify the MHC class n restricting allele for the L762P-peptide specific responses 
of CD4 T cell clones derived from lines that recognized L762P peptide and recombinant 
protein. Clones from two lines, AD-5 and EA-7, were tested as described below. The 
AD-5 derived clones were foimd to be restricted by the HLA-DRB-UOl allele, and an 

10 EA-7 derived clone was found to be restricted by the HLA DRB-0701 or DQB 1-0202 
allele. Identification of the restriction allele allows targeting of vaccine therapies using 
the defined peptide to individuals that express the relevant class 11 allele. Knowing the 
relevant restricting allele will also enable clinical monitoring for responses to the 
defined peptide since only individuals that express the relevant allele will be monitored. 

15 CD4 T cell clones derived from line AD-5 and EA-7 were stunulated on 

autologous APC pulsed with the specific peptide at 10 ^g/ml, and tested for recognition 
of autologous APC (from donor D72) as well as against a panel of APC partially 
matched with D72 at class II alleles. Table 2 shows the HLA class typing of the APC 
tested. Adherent monocytes (generated by 2 hour adherence) from four different 

20 donors, referred to as D45, D187, D208, and D326, were used as APC in these 
experiments. Autologous APC were not included in the experiment Each of the APC 
were pulsed with the relevant peptide (5a for AD-5 and 3e for 3A-7) or the irrelevant 
mammoglobin peptide at 10 jxg/ml, and cultures were established for 10,000 T cells and 
about 20,000 APC/well. As shown in Table 3, specific proliferation and cytokine 

25 production could be detected only when partially matched donor cells were used as 
APC. Based on the MHC typing analysis, these results strongly suggest that the 
restricting allele for the L762 -specific response of the AD-5 derived clones is HLA- 
DRB-1101 and for the EA-7 derived clone the restrictmg allele is HLA DRB-0701 or 
DQBl-0202. 
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Table 2 - HLA Typing of APP 



DONOR 


DR 


DR 


DQ 


DQ 


D72 


Bl-1101 


Bl-0701 


Bl-0202 


Bl-0301 


D45 


-3 


-15 


Bl-0201 


Bl-0602 


D187 


-4 


-15 


-1 


-7 


D208 


Bl-1101 


Bl-0407 


-3 


-3 


D326 


Bl-0301 


Bl-0701 


Bl-0202 


Bl-0201 
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D72 
DR-0701, 
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EXAMPLE 11 

FUSION PROTRMS OFN-TERMINAT, AXrn P-TERMTNAT. PnP noNS OF T,7fi^P 
In another embodiment, a Mycobacterium tuberculosis-dcdysd 
polynucleotide, refened to as Ral2, is linked to at least an immunogenic portion of a 
5 polynucleotide of this invention. Ral2 compositions and methods for their use in 
enhancing expression of heterologous polynucleotide sequences are described in U.S. 
Patent Application 60/158,585. the disclosure of Tvhich is incoiporated herein by 
reference in its entirely. Briefly, Ral2 refers to a polynucleotide region that is a 
subsequence of a Myce^acterium tuberculosis MTB32A nucleic acid. MTB32A is a 
10 serine protease of 32 KD molecular weight encoded by a gene in virulent and avirulent 
strains of M tuberculosis. The nucleotide sequence and amino acid sequence of 
]VfrB32A have been described (for example, US. Patent Application 60/158,585; see 
also, Skeiky et al.. Infection and Immun. (1999) 67:3998-4007. incorporated herein by 
reference). Suiprisingly, it was discovered that a 14 KD C-terminal fiagment of the 
15 MTB32A coding sequence expresses at high levels on its own and remains as a soluble 
protem throughout the purification process. Moreover, this fiagment may enhance the 
immunogenicity of heterologous antigenic polypeptides wifli which it is fiised. -nrn 14 
KD C-terminal fi^ent of the MTB32A is refened to herein as Ral2 and represents a 
fi-agment comprising some or all of ammo acid residues 192 to 323 of MTB32A. 
20 Recombinant nucleic adds which encode a fiision polypeptide 

comprising a Ral2 polypeptide and a heterologous lung tumor polypeptide of interest, 
can be readily constructed by conventional genetic engineering techniques. 
Recombinant nucleic adds are constructed so that, preferably, a Ral2 polynucleotide 
sequence is located 5' to a selected heterologous lung tumor polynucleotide sequence. 
25 It may also be appropriate to place a Ral2 polynucleotide sequence 3' to a selected 
heterologous polynucleotide sequence or to insert a heterologous polynucleotide 
sequence mto a site within a Ral2 polynucleotide sequence. 

In addition, any suitable polynucleotide tiiat encodes a Ral2 or a portion 
or other variant tiiereof can be used in constructing recombinant fijsion polynucleotides 
30 comprising Ral2 and one or more lung tumor polynucleotides disclosed herein. 
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Preferred Ral2 polynucleotides generally comprise at least about 15 consecutive 
nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, at least about 
100 nucleotides, at least about 200 nucleotides, or at least about 300 nucleotides that 
encode a portion of a Ral2 polypeptide. 
5 Ral2 polynucleotides may comprise a native sequence (ie,, an 

endogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may 
comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one 
or more substitutions, additions, deletions and/or insertions such that the biological 
activity of the encoded fusion polypeptide is not substantially diminished, relative to a 

1 0 fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at 
least about 70% identity, more preferably at least about 80% identity and most 
preferably at least about 90% identity to a polynucleotide sequence that encodes a native 
Ral2 polypq)tide or a portion thereof. 

Two specific embodiments of fusions between Ral2 and antigens of the 

1 5 presmt invention are described in this example. 

A. N-Terminal Portion of L763P 

A fusion protein of full-length Ral2 and the N-terminal portion of L763P (referred to as 
L763P-N; amino acid residues 1-130 of SEQ ID NO: 159) was expressed as a single 
recombinant protein in E. coli. The cDNA for the N-terminal portion was obtained by 

20 PGR with a cDNA for the full length L763P and primers L763F3 (5' 
CGGCGAATTCATGGATTGGGGGACGCTGC; SEQ ID NO: 383) and 1763RV3 (5' 
CGGCCTCGAGTCACCCCTCTATCCGAACCTTCTGC; SEQ ID NO: 384). The 
PGR product with expected size was recovered from agarose gel, digested with 
restriction enzymes EcoRI and Xhol, and cloned into the corresponding sites in the 

25 expression vector pGRXl. The sequence for the fusion of full-length of Ral2 and 
L763P-N was confirmed by DNA sequencing. The determined cDNA sequence is 
provided in SEQ ID NO:351, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 352). 
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C-Tenninal Portion of T.7fi^P 

A fusion protein of fofl-length Ral2 and the C-teiminal portion of L763P 
(referred to as L763P-C; amino add residues 100-262 of SEQ ID NO: 159) was 
expressed as a single recombinant protein in R colL The cDNA of the Cterminal 
5 portion of L763P was obtained by PGR with a cDNA for the full length of L763P and 
primers L763F4 (5' CGGGGAATTGGAGGAAGGAGTGGGAAGTTGAG; SEQ ID 
NO: 385) and L763RV4 (5' CGGCTCGAG-TTAGCTTGGGCCTGTGArrGC; SbQ 
ID NO: 386). The PGR product with expected size was recovered from agarose gel 
digested with restriction enzymes EcoRI and Xhol, and cloned into the corresponding 
10 sites in the expression vector pCRXl. The sequence for the fusion of full-length Ral2 
and L763P-C was confirmed by DNA sequencing. The determined DNA sequence is 
provided in SEQ ID NO:353, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 354. 

nie recombinant proteins described in this example are useful for the 
15 pr^aration of vaccines, for antibody therapeutics, and for diagm,sis of lung tumors. 



EXAMPLE 12 

EXPRESSION TN F. COUQV MfOP tag FT JSTON pphtptxt 

20 PGR was performed on the L762P coding region with the following 

primers: 

Forward primer starting at amino add 32. 

PDM-278 5'ggagtacagcttcaagacaatggg 3' (SEQ ID N0:355) Tm 57°C. 
Reverse primer including natural stop codon after amino add 920. 
25 creating EcoRI site 

PDM.280 5'ccatgggaattcattataataattttgttcc 3' (SEQ ID NO-356^ 
TM55"G. ■ ' 

The PGR product was digested with EcoRI restriction enzyme gel 
purified and then cloned into pPDM His, a modified pEl^S vector with a His tag in 
30 fi^«. whidr had been digested with EC072I and EcoRI restriction errzymes. Tie 
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correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 (DE3) pLys S and BL21 (DE3) CodonPlus RIL expression hosts. 

The protein sequence of expressed recombinant L762P is shown in SEQ 
ID NO:357, and the DNA sequence is shown in SEQ ID NO:358. 

5 

EXAMPLE 13 

EXPRESSION IN E. COLI OF A L773PA HIS TAG FUSION PROTEIN 

The L773PA coding region (encoding amino acids 2-71 of SEQ ID NO: 
10 172) was PGR amplified using the following primers: 

Forward primer for L773PA starting at amino acid 2: 

PDM-299 5'tggcagcccctcttcttcaagtggc 3' (SEQ ID NO:359) Tm63*'C. 

Reverse primer for L773PA creating artificial stop codon after amino 

acid 70: 

15 PDM-355 5'cgccagaattcatcaaacaaatctgttagcacc3*(SEQIDNO:360) 

Tm62**G. 

The resulting PGR product was digested with EcoRI restriction enzyme, 
gel purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
fiBme, which had been digested with Eco72I and EcoRI restriction enzymes. The 
20 correct construct was confirmed by DNA sequence analysis and transformed into BL21 
(DE3) pLys S and BL21 (DE3) CodonPlus RIL expression hosts. 

The protein sequence of expressed recombinant L773PA is shown in 
SEQ ID NO:361, and tlie DNA sequence is shown in SEQ ID NO:362. 

25 

EXAMPLE 14 

IDENTIFIGATION OF EPITOPES DERIVED FROM LUNG TUMOR SPECIFIC 

POLYPEPTIDES 

A series of peptides fiom the L773P amino acid sequence (SEQ ID NO: 
30 172) were synthesized and used in in viti'o priming experiments to generate peptide- 
specific GD4 T cells. These peptides were 20-mers that overlapped by 15 amino acids 
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and corresponded to amino acids 1-69 of the L773P p«>tein. This region has been 
demonstrated to be tumor-specific. Following three in vitro stimulations, CD4 T cell 
Imes were identified that produced IFNy in response to the stimulating peptide but not 
thecontrolpeptide. Some of these T cell lines demonsti^ed recognition of recombinant 
5 L773PandL773PA(tumor-specificregion)proteins. 

To perform tiie experimeats, a total of eleven 20-mer peptides (SEQ ID 
NOs: 363. 365 and 387-395) overlapping by 15 amino adds and derived from ti.e N- 
termmal tumor-spedfic region of L773P (corresponding to amino acids 1-69 of SEQ ID 
NO:172) we.e generated by standard procedures. Dendritic cells were derived from 
PBMCofanormaldonorusingGMCSFandII.4bystandardpn,toco]. PurifiedCD4T 
cells were genemted fiom the same donor as tire dendritic cells using MACS beads and 

negative selection of PBMCs.Dendriticcells were pulsedovemightwiti. the individual 
20-mer peptides at a concentration of 1 0 ^g/ml. Pulsed dendritic cells were washed and 
plated at^l x lOWlI of a 96-well U-bottom plates, and purified CD4 cells were added 
at 1 X 10 well. Cultures were supplemented with 10 ng/ml IL-6 and 5 ng/ml IL-12 and 
incubated at 37«>C. Cultures were re-stimulated as above on a weekly basis usirig as 
APC dendntic cells generated and pulsed as above, supplemented wifli 5 ng/ml IL-7 and 
1 0 f.g/ml IL-2. Following 3 in vitro stimulation cycles, cell lines (each corresponding to 
one well) were tested for cytokine production in r^nse to tire stimulating peptide vs 
20 an irrelevant peptide. 

A small number of individual CD4 T cell lines (9/528) demonstrated 

cytokine release (lFNr)in response to tire stimulatingpeptide but not to control peptide 
The CD4 T cell lines tirat demonstrated specific activity were restimulated on tire 
appropriate L773P peptide and reassayed using autologous dendritic ceUs pulsed witir 
25 10 ^g/ml of tire appropriate L773P peptide, an irrelevant control peptide, r^mbinant 
L773P protein (amino adds 2-364, made in£ co/0, recombinant L773PA (amino acids 
2-71, made m E. coli), or an appropriate control protdn (L3E, made in E. coli) Three 
of tire nine tines tested (1-3C, 1-6G, and 4-12B) recognized the appropriate L773P 
peptide as well as recombinant L773P and L773PA. Four of the lines tested (4-8A, 4- 
8E, 4-12D, and 4-12E) r^gnized tire appropriate L773P peptide only. Two of tire 
lures tested (5-6F and 9-3B) demonstrated non-spedfic activity 
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These results demonstrate that the pq)tide sequences 
MWQPLFFKWLLSCCPGSSQl (amino acids 1-20 of SEQ ID NO: 172; SEQ ID 
NO:363) and GSSQIAAAASTQPEDDINTQ (amino acids 16-35 of SEQ ID NO: 172; 
SEQ ID NO: 365) may represent naturally processed epitopes of L773P, which are 
5 capable of stimulating human class II MHC-restricted CD4 T cell responses. 

In subsequent studies, the above epitope mapping experiment was 
repeated using a different donor. Again, some of the resulting T cell lines were foimd to 
respond to peptide and recombinant protein. An additional peptide was found to be 
naturally processed. Specifically, purified CD4 cells were stimulated on a total of 

10 eleven 20-mer peptides overlapping by 1 5 amino acids (SEQ ID NO: 363, 387, 388, 365 
and 389-395, respectively). The priming was carried out as described above, except that 
a peptide concentration of 0.5 ug/mL rather than 10 ug/mL was employed. In the initial 
screen of the cell Imes 9 of the 528 lines released at least a three-fold greater level of 
BFN-gamma with stimulating peptide vs. control peptide. These 9 lines were 

15 restimulated on the appropriate peptide and then tested on dendritic cells pulsed with a 
titration of appropriate peptide (10 ug/mL, 1 ug/mL and 0.1 ug/mL), and 10 ug/mL of a 
control peptide. Six of the 9 lines recognized recombinant L773P as well as peptide. 
The six lines referred to as 1-lE, 1-2E, 1-4H, 1-6A, 1-6G and 2-1 2B recognized 
L773PA and the appropriate peptide. These results demonstrate that the peptides of 

20 SEQ ID NO: 363 and 387 represent naturally processed epitopes of L773P. 

Using the procedures described above, CD4+ T cell responses were 
generated fiom PBMC of normal donors using dendritic cells pulsed with overlapping 
20-mer peptides (SEQ ID NO: 396-419) spanning the L523S polypeptide sequence 
(SEQ ID NO: 176). A number of CD4+ T cells demonstrated reactivity with the 

25 priming peptides as well as with L523S recombinant protein, with the dominant 
reactivity of these lines being within the peptides 4, 7 and 21 (SEQ ID NO: 399, 402 
and 416; corresponding to amino acids 30-39, 60-79 and 200-219, respectively, of SEQ 
ID NO: 176). 

Epitopes within the scope of the invention include epitopes restricted by 
30 other class II MHC molecules. In addition, variants of the peptide can be produced 
wherein one or more amino acids are altered such that there is no effect on the ability of 
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the peptides to bind to MHC molecules, no effect on their ability to eUcit T cell 
responses, and no effect on the ability of the elicited T cells to recognize recombinant 
protein. 



EXAMPLE 15 
SURFACE KVPRESSION OF T .7fi9P a>tp 
ANTIBODY EPITOPES THFPPni? 
Rabbits were immunized with full-length histidine-tagged L762P protein 
10 generated in E. colL Sera was isolated from rabbits and screened for specific 
recognition of L762P in EUSA assays. One polyclonal serum, referred to as 2692L, was 
identified that specifically recognized recombinant L762P protein. The 2692L anti- 
L762P polyclonal antibodies were purified from the serum by affinity purification using 
L762P affinity columns. Although L762P is expressed in a subset of primary lung tumor 
15 samples, expression appears to be lost in established lung tumor cell lines. Woi«, 
to characterize surface expression of L762P, a retrovirus construct that expresses L762P 
was used to transduce primary human fibroblasts as well as 3 lung tumor cell lines (522- 
23, HTB. and 343T). Transduced lines were selected and expanded to examine L762P 
surface expression by FACS analysis. For this analysis, non-transduced and transduced 
20 cells were harvested using cell dissociation medium, and incubated with 10-50 
micrograms/ml of either affinity purified anti-L762P or irrelevant antisera. FoUowing a 
30 minute incubation on ice, cells were washed and incubated with a secondary, mc 
conjugated, anti rabbit IgG antibody as above. Cells were washed, resuspended m buffer 
with Propidium Iodide (PI) and examined by FACS using an Excalibur fluorescence 
25 activated cell sorter. For FACS analysis. Pl-positive (i.e. dead/permeabilized cells) were 
excluded. The polyclonal anti-L762P sera specifically recognized and bound to the 
surface of L762P-transduced cells but not the non-transduced counterparts. These 
results demonstrate that L762P is locahzed to the ceU surfece of both fibroblasts as well 
as lung tumor cells. 

To identify the peptide epitopes recognized by 2692L, an epitope 
mapping approach was pursued. A series of overlapping 19-21 mers (5 amino acid 
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overlap) was synthesized that spanned the C terminal portion of L762P (amino acids 
481-894 of SEQ ID NO: 161). In an initial experiment peptides were tested in pools. 
Specific reactivity with the L762P antiserum was observed with pools A, B, C, and E. 
To identify the specific peptides recognized by die antiserum, flat bottom 96 well 

5 microtiter plates were coated with individual peptides at 1 0 microgram/ml for 2 hours at 
37 ^C. Wells were then aspirated and blocked with phosphate buifered saline containing 
5% (w/v) milk for 2 hours at 37 ^C, and subsequently washed in PBS containing 0.1% 
Tween 20 (PBST). Purified rabbit anti-L762P serum 2692L was added at 200 or 20 
ng/well to triplicate wells in PBST and incubated overnight at room temperature. This 

10 was followed by washing 6 times with PBST and subsequently incubating with HRP- 
conjugated donkey anti rabbit IgG (H+L)AfBnipure F(ab*) fi:agment at 1:2,000 for 60 
mmutes. Plates were then washed, and incubated in tetramethyl benzidine substrate. 
Reactions were stopped by the addition of IN sulfimc acid and plates were read at 
450/570 nm using an ELISA plate reader. 

15 The resulting data, presented in Table 4 below, demonstrates that the 

L762P antisera recognized at least 6 distinct peptide epitopes from the 3' half of L762P. 
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Table 4 



Peptide (starting 
amino acid of L762P) 



A (481).,, 



B (495) 



4 : C(5U) 



D(S26) 



E (541) 



F(556) 



G(571) 



poo] 



A 



H(586) 



1(601) . 



J (616) 



K(631) 



•:g(646)J 



M(661) 



N(676) 



l(691>- 



P(706) 
Q(72]) 



R(736) 



S(751) 



U (781) 
V(795) 



X (826) 



A 
A 



B 



_3[ 

B 



B 
C 



c 

E 



C 
D 



Y(841) 



Z(856) 



AA (871) 



BB(874):; 
No peptide 



F 
D 



D 
D 



ELISA activity (OP 4.srLj;7n 



200 ng 
polyclonal serum 



ii76. 



0.14 



0.11 



0.11 



0.04 



0.06 



0.1 



Oi'25 



0.1 



0.1 



oar 



0.14 



0.12 



%,1 



0.1 



0.11 



0.12 



0.15 



0.12 



0.07 



0.1 



0.17 



0.16 
0.17 



0,14; 



0.15 



20 ng 
polyclonal serum 



f.O 



.06 



0,18 



0.09 



0.04 



0.02 



0.02 



0.03 



0.03 



0.08 



«12 If-j,. 



0.03 



0.1 



0.03 



0.05 



0.04 



0.06 



0.06 



0.05 



0.03 



0.08 



0.05 



QMS 



Individual peptides were identified from each of the pools, and 
additionally a weak ^activity was identified with peptide BB from pool R The relevant 
peptide epitopes are summarized in the Table 5 below Ihe amino acid ^quences for 
peptides BB. O, L, I, A and C are provided in SEQ ID NO: 376-381, respectively, with 
the corresponding cDNA sequences being provided in SEQ ID NO: 373, 370, 372, 374, 
371 and 375, respectively. 
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Tables 



ELISA activity 
fOD 450-570) 



Peptide 


Nucleotides 
ofL762P 


Amino 
acids of 
L762P 


Sequence 


pool 


200 ng 


20 ng 


A 


1441-1500 


481-500 


SRISSGTGDIFQQHIQLEST 


A 


1.76 


1.0 


C 


153M590 


511-530 


KNTVTVDNTVGNDTMFLVTW 


E 


0.47 


0.18 


I 


1801-1860 


601-620 


AVPPATVEAFVERDSLHFPH 


B 


0.25 


0.06 


L 


1936-1955 


646-665 


PETGDPVTLRLLDDGAGADV 


B 


0.28 


0.12 


O 


2071-2130 


691-710 


VNHSPSISTPAHSIPGSHAMIL 


C 


1.1 


0^3 


BB 


2620-2679 


874-893 


LQSAVSNIAQAPLFIPPNSD 


F 


0.14 


0.11 


None 










0.15 


0.05 



EXAMPLE 16 

5 DETECTION OF ANTIBODIES AGAINST LUNG TUMOR ANTIGENS 

IN PATIENT SERA 

Antibodies specific for the lung tumor antigens L773PA (SEQ ID 
NO:361), L514S (SEQ ID n6:155 and \56\ L523S (SEQ ID NO:176X L762P (SEQ ID 

10 N0:161) and L763P (SEQ ID NO:159) were shown to be present in effusion fluid or 
sera of lung cancer patients but not in normal donors. More specifically, the presence of 
antibodies against L773PA, L514S, L523S, L762P and L763P in effusion fluid obtained 
from lung cancer patients and in sera from normal donors was detected by ELISA using 
recombinant proteins and HRP-conjugated anti-human Ig. Briefly, each protein (1.00 

15 ng) was coated in 96-well plate at pH 9.5. In parallel, BSA (bovine serum albumin) was 
also coated as a control protein. The signals ([S], absorbance measured at 405 nm) 
against BSA ([N]) were determined. The results of these studies are shown in Table 6, 
wherein - represents [S]/[N] < 2; +/- represents [S]/[N] >2; ++ represents [S]/[N] >3; 
and -H-+ represents [S]/[N] >5. 

20 
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Table 6 - Detection of Antihnd;.. . f,^-^ Lung T..mnr Ar.fi^. 




Using Western blot analyses, antibodies against L523S were found to be 
present in 3 out of 4 san^ples of eflusion fluid from lung cancer patients, witi, no L523S 
antibodies being detected in the three samples of nonnal sera tested. 



EXAMPLE 17 

mm siO^mK COZ/OF _AL514SHTSTAamT.Tn.T^^^ 

PGR was perfonned on the L514S,13160 coding region with the 
following primers: 
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Forward primer PDM-278 5' cacactagtgtccgcgtggcggcctac 3' (SEQ ID 
NO:421)Tm 67*^0. 

Reverse primer PDM-280 5' catgagaattcatcacatgcccttgaaggctccc 3* 
(SEQ ID NO:422) TM 66°C. 



50r|g DNA 

96°C for 2 minutes, 96°C for 20 seconds, 66^C for 15 seconds, 72°C for 
1 minute with 40 cycles and then 72°C for 4 minutes. 

The PGR product was digested with EcoRI restriction enzyme, gel 
15 purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
fimne, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 CodonPlus (Stratagene, La Jolla, CA) cells for expression. 

The amino acid sequence of expressed recombinant L514S is shown in 
20 SEQ ID NO:423, and the DNA coding region sequence is shown in SEQ ID NO:424. 



5 



The PGR conditions were as follows: 



10 



lOfil lOXPfu buffer 
l.OiillQmMdNTPs 
2.0^1 lOfiM each primer 
83fil sterile water 

1 .5\xl Pfu DNA polymerase (Stratagene, La Jolla, CA) 



EXAMPLE 18 



EXPRESSION IN E, COLIOV A L523S HIS TAG FUSION PROTEIN 



25 



PGR was performed on the L523S coding region with the following 



primers: 



Forward primer PDM-414 5' aacaaactgtatatcggaaacctcagcgagaa 3' (SiEQ 



IDNO:425)Tm62^G. 



Reverse primer PDM-415 5' ccatagaattcattacttccgtcttgactgagg 3* (SEQ 



30 IDNO:426)TM62**G. 
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The PGR conditions were as follows: 
lOjiil 1 OX Pft buffer 
l Ofd lOmMdNTPs 
2.0^1 lOfiM each primer 
83jjil sterile water 

1 .5^1 Pfu DNA polymerase (Stratagene. La JoUa, CA) 
50tigDNA 

960C for 2 minutes, 960C for 20 seconds. 62«'C for 1 5 seconds, 72»C for 
4 mmutes with 40 cycles and then 72°C for 4 minutes. 

''^^ digested with EcoRI restriction enzyme, gel 

Punfied and then cloned into pPDM His. a modified vector wi«, a His tag in 

^e, wh^ch had been digested with Eco721 and EcoRI restriction enz^es I.e 

riTci:^' r ^^""""^ --donned into 

BL21CodonPlus(Stratagene.LaJolIa,CA)cells for expression. 

SEO ID NO rr °' "^^"''^-^ ^^^BS is shown in 

SEQIDNO:427.andtheDNAcodingregionsequenceisshowninSEQIDNO:428. 



20 



EXAMPLE 19 

^5 NO:355,T»,5rT' """" ^ 3' (SEQ U> 

n>N0:4a„™;r 3. (SEQ 

The PGR conditions were as follows: ' 
10^11 OX Pfii buffer 

30 

1.0^1 lOmMdNTPs 
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2.0jil 10)iM each primer 
83 1^1 sterile water 

1 .5|xl Pfu DNA polymerase (Stratagene, La JoUa, CA) 
50r|gDNA 

5 96°C for 2 minutes, 96°C for 20 seconds, 55^C for 15 seconds, 72°C for 

- 5 minutes with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
firame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
10 correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 pLys S (Novagen, Madison, WI) cells for expression. 

The amino acid sequence of expressed recombinant L762PA is shown in 
SEQ ID NO:430, and the DNA coding region sequence is shown in SEQ ID NO:43 1 . 

15 

EXAMPLE 20 

EXPRESSION IN K COLIOF A L773P HIS TAG FUSION PROTEIN 

PCR was performed on the L773P coding region with the following 

primers: 

20 Forward primer PDM-299 5' tggcagcccctcttcttcaagtggc 3' (SEQ ID 

NO:359)Tm63*^C. 

Reverse primer PDM-300 5' cgcctgctcgagtcattaatattcatcagaaaatgg 3' 
(SEQ ID NO:432) TM 63^C. 

The PCR conditions were as follows: 
25 lO^ill OX Pfu buffer 

l.OjLil lOmMdNTPs 
2.0|al 1 0|jiM each primer 
83)Ld sterile water 

1 .5|il Pfii DNA polymerase (Stratagene, La JoUa, CA) 
30 50iigDNA 
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96°C for 2 minutes, 96»C for 20 seconds, 63<>C for 15 seconds, 72°C for 
2 minutes 1 5 seconds with 40 cycles and then 72»C for 4 minutes. 

n»e PCR product Avas digested with EcoRI restriction enzyme gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tig i„ 

5 which had been digested with EC072I and ECORI restriction enzymes TT,e 

^rrect construct was confirmed by DNA sequence analysis and then transfo^ed into 
BL2 1 pLys S (Novagen, Madison, WI) and BL2 1 CodonPlus (Strntagene, La Jolla, CA) 
cells for expression. 

Hie amino acid sequence of expressed recombinant L773P is shown in 
10 SEQIDNO:433,anddaeDNAcodingregionsequenceisshowninSEQIDNO:434. 



15 



EXAMPLE 21 

CLONING AND SROTI^QjGQFATVm^CEPTOR Pn. xu. 

LUNG SPRCIFir ANTTOKN T 7/;op 
T cell receptor (TCR) alpha and beta chains fiom a CD4 T cell clone 
specific for the lung specific antigen L762P were cloned and sequence. Basically total 
from 2 X 1 0^ ceUs from CIL clone 4H6 was isolated using Trizol reagent and 

^NAwassynthesizedusmgReady-togokits(Pha™acia).TodetennineV^^^^ 
Vbeta sequences of this clone, a panel of Valpha and Vbeta subtype sp^ific primers 
was synthesized and used inRT-PCR reactions with cDNA generated fiom each of the 
clones. T„e RT-PCR reactions demonstrat«l that each of the c,on« expressed a 
common Vbeta sequence that corresponded to the Vbetag subfamily and a Valpha 
sequencethatcorxespondedtotheValphagsubfamily. To clone the full TCR alpha and 
beta chains fi.m clone 4H6, primers were designed that spanned the initiator and 
termmator-coding TCR nucleotides. The primers were as follows: 

forward prfmer for TCR ValphaS 5' 

ggatccgccgccaccatgacatccattcgagctgta 3' (SEQ ID NO:435; has a BamHI site 
inserted); 
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Kozak reverse primer for TCR ValphaS (antisense) 5' 
gtcgactcagctggaccacagccgcag 3' (SEQ ID NO:436; has a Sail site iiiserted plus 
the TCR alpha constant sequence); 

forward primer for TCR VbetaS (sense) 5' 
5 ggatccgccgccaccatggactcctggaccttctgct 3' (SEQ ID NO:437; has a BamHI site 

inserted); and 

Kozak reverse prinaer for TCR Vbeta 5' gtcgactcagaaatcctttctcttgac 3' 
(SEQ ID NO:438; has a Sail site inserted plus the TCR beta constant sequence). 
Standard 35 cycle RT-PCR reactions were established using the cDNA synthesized 

10 from the CTL clone and the above primers utilizing the proofreading thermostable 
polymerase, PWO (Roche). The resultant PGR band, about 850 bp for Valpha and 
about 950 for Vbeta, was ligated into a PGR blunt vector (Invitrogen) and transformed 
into E. colt. E. coli transformed with plasmids having full-length alpha and beta chains 
were identified.. Large scale preparations of the corresponding plasmids were 

15 generated, and these plasmids were sequenced. The Valpha sequence (SEQ ID 
NO:439) was shown by nucleotide sequence alignment to be homologous to ValphaS. 1, 
while the Vbeta sequence (SEQ ID NO:440) was shown by nucleotide sequence 
alignment to be homologous to Vbeta82. 

20 

EXAMPLE 22 

RECOMBINANT EXPRESSION OF FULL LENGTH L762P IN MAMMALAIN 

CELLS 

Full length L762P cDNA was subcloned into the mammalian expression 
25 vectors VR1012 and pCEP4 (Invitrogen). Both expression vectors had previously been 
modified to contain a FLAG epitope tag. These constructs were transfected into 
HEK293 and CHL-1 cells (ATCC) using Lipofectamine 2000 reagent (Gibco). Briefly, 
both the HEK and CHL-1 cells were plated at a density of 100,000 cells/ml in DMEM 
(Gibco) containing 10% FBS (Hyclone) and grown overnight. The following day, 4^1 
30 of Lipofectamme 2000 was added to 1 00^1 of DMEM containing no FBS and incubated 
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for 5 minutes at room temperature. The lipofectamine/DMEM mixture was then added 
to mg of L762P FIag/pCEP4 or L762P FlagA^l012 plasmid DNA resuspended in 
lOOjd DMEM and incubated for 15 minutes at room temperature TT,e 
Lipofectamine/DNA mix was then added to the HEK293 and CHL-1 cells and 
5 mcubated for 48-72 hours at 37«>C with m CO,. Cells were rinsed with PBS then 
conected and pelleted by centrifugation. L672P expression was detected il- the 
transfected HEK293 and CHL-1 cell lysates by Western blot analysis and was detected 
on the surface of transfected HEK cells by flow cytometry analysis. 

For Western blot analysis, whole cell lysates were generated by 
10 mcubating the cells in Triton-XlOO containing lysis buffer for 30 minutes on ice 
Lysates were then cleared by centrifugation at 10,000 rpm for 5 minutes at 40C 
Samples were diluted with SDS-PAGE loading buffer contaim'ng beta-mercaptoethanol 
then borled for 10 minutes prior to loading the SDS-PAGE gel. Ite protein was' 
transfer!^ to mtroceUulose and probed using 1 ^ml purified anti.L762P rabbit 
15 polyclonal sera (lot #690/73) or non-diluted anti-L762P mAb 153.20.1 supernatant 
Blots were revealed using either goat anti-rabbit Ig coupled to HRP or goat anti-mouse 
Ig coupled to HRP followed by incubation in ECL substrate. 

For flow cytometric analysis, cells were washed further with ice cold 
staining buffer (PBSH-I0/.BSA .Azide). Next, the cells were incubated for 30 minutes 
20 on ice with lOug^ of purified a„ti.L762P polyclonal sera (lot #690/73) or a 12 
dilution of anti-L762P mAb 153.20.1 sup^natant. The cells were washed 3 times with 
^^ammg buffer and then incubated with a 1:100 dilution of goat anti-rabbit Ig(H+L> 
FITC or goat anti-mouse Ig(HfL>FITC reagent (Southern Biotechnology) for 30 
minutesonice. After3 washes, the cells were resuspended m staining buffer containbg 

25 P™Pidiunriodide(PI),avitalstainthatallowsfortheexclusionofpemaeableceU^^ 
analyzed by flow cytometry. 
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EXAMPLE 23 

GENERATION OF POLYCLONAL ANTBODIES TO LUNG TUMOR ANTIGENS 
Three lung antigens, L523S (SEQ ID NO:176), L763P (SEQ ID NO:159) 
and L763 peptide #2684 (SEQ ID NO:441), were expressed and purified for use in 
5 antibody generation. 

L523S and L763P were expressed in an £: coU recombinant expression 
system and grown overnight in LB Broth with the appropriate antibiotics at 37°C in a 
shaking incubator. The next morning, 10 ml of the overnight culture was added to 500 
ml of 2x YT with the appropriate antibiotics in a 2L-baf0[ed Erlenmeyer flask. When 
10 the optical density of the culture reached 0,4-0.6 at 560 nanometers, the cells were 
induced with IPTG (1 mM). Four hours after induction with IPTG, the cells were 
harvested by centrifiigation. 

The cells were then washed with phosphate buffered saline and 
centrifuged again. The supernatant was discarded and the cells were either frozen for 
1 5 future use or immediately processed. Twenty milliliters of lysis buffer was added to the 
cell pellets and vortexed. To break open the £1 coli cells, this mixture was then run 
througli a french press at a pressure of 16,000 psi. The cells were then centrifuged again 
and the supernatant and pellet were checked by SDS-PAGE for the partitioning of the 
recombinant protein. 

20 For proteins that localized to the cell pellet, the pellet was resuspended in 

10 mM Tris pH 8.0, 1% CHAPS and the inclusion body pellet was washed and 
centrifuged again. This procedure was repeated twice more. The washed inclusion body 
pellet was solubilized with either 8M urea or 6M guanidine HCl containing 10 mM Tris 
pH 8.0 plus 10 mM imidazole. The solubilized protein was added to 5 ml of nickel- 

25 chelate resin (Qiagen) and incubated for 45 minutes to 1 ho\ur at room temperature with 
continuous agitation. 

After incubation, the resin and protein mixture was poured through a 
disposable column and the flow through was collected. The column was then washed 
with 10-20 column volumes of the solubilization buffer. The antigen was then eluted 

30 from the column using 8M urea, 10 mM Tris pH 8.0 and 300 mM iihidazole and 
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collected in 3 ml fractions. A SD^-PAnc ^^i . ^ . 

ns. A i>DS.PAGE gel was run to detennine which fractions to 
pool for further purification. 

M.Bmt pudficadon step, a stong ^ exchange .esto. i„ ftis case 
ft-I^ Q CBioradX waa e<paiib«ed wi«, to app,„pHa.e buffer and fte pooled 
5 fi^^ona ScM above „e,e loaded onto to col^nn. Each andgen ™ el«ed offto 
column wia, an inc^aaing sal, g«die„t j^,^ „^ ^„^,^ as to col™, «aa 

™ a.^, anotor SDS-PAOB gel ™ ™n . de.e™ which iacUons fta to col»,n 

to pooL 

^'"^"'^dialyKd against 10 mMTmpH 8.0 Jle 
Jlease criteria ^ ^ de,™ed by SDS-PAGE or HPLC, conc«,«on as 
*ten»n«l by Uwy assay or Amino Acid Analy^s. ,d«i,y dettnnined by annno 
.«™,nal protein sequence, and endotoxin level was detennined by to Ltouh» (LAL) 
assay^ The p,.teina were then put in vials ate ffl,r«ion 4„ugh a 0.22-n„cron filter 
and to antigens were ftozen until needed for immmfzation. 

'5 '^''•'"•^P"* sy.«besized and conjugated .OKLH and 

lioze until needed for immunization. 

lha polyclonal a«ise>a w«e gene«ted using 400 micmgrama of each 
tog anng«, combined with 100 unc.^^ of muramyldipepdde (MDP). An equal 
vohmte of Incomplete Frennd^s Adjuvant (IPA) was added and then mixed »d injected 
subcutane„usly(S.C.)mtearabbi. A«er f™, ^ to mbbi. was S.C. boosted with 
20O m.cn>grama of amigen mixed with an equal volume of IFA. TheteaHer the tabbit 
^ ..V. boosted with ,00 micros of andgea The ™nal was bled seven days 
Mowmg each boost The blood was ton incubated a, 4»C for 12-24 horn, follow^ 
by cenlriSigation to genaate to sera. 

25 ""^ 'l«m«W«d using 96 well plates coated 

wtd. amigen and incubated with 50 microliters (typically. n,icrog,am/micmhter)„, 
pob.lon.1 and^ „ « for 20 hours. Basically, 250 mic^iite. of BSA blocUng 
buar was «Med to to wells and incubated at room f„ , ^ p J 

30 '"'"'^''^''^'''^^O-^''''^^^ Th.rabb,-.sen.„e,edilu,edmPBS/0.m 
30 Twee.,0.1.^^. 5« microliters of diluted sera was added te each weB and i„cub«ed 
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at room temperature for 30 minutes. The plates were washed as described above, and 
then 50 microliters of goat anti-rabbit horseradish peroxidase (HRP) at a 1:10000 
dilution was added and incubated at room temperature for 30 minutes. 

The plates were washed as described above, and 100 microliters of TMB 
5 Microwell Peroxidase Substrate was added to each well. Following a 15-minute 
incubation in the dark at room temperature, the colorimetric reaction was stopped with 
100 microliters of IN H2SO4 and read immediately at 450 nm. All the polyclonal 
antibodies showed inMnimoreactivity to the appropriate antigenTables 7-9 show the 
antibody reactivity of rabbit antisera in serial dilution to the three lung antigens, L523S, 

10 L763P and L763 peptide #2684. The first column shows the antibody dilutions. The 
columns "Pre-immune sera" indicate ELISA data for two experiments using pre- 
immune sera. These results are averaged in the fourth column. The columns "anti- 
L523S, L763P or #2684" indicate ELISA data for two experiments using sera from 
rabbits immunized as described in this Example, using the respective antigen, referred 

15 to as either L523S, L763P or #2684 in the tables. 
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Table 7 




Table 8 
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Table 9 



Antibody 
dilution 


Pre- 
uninunv 

sera (1) 


rre- 

IlIUlIUIlC 

sera (2) 


Average 


Anti- 

(1) 


Anti- 
(2) 


Average 


1 


1000 


0.07 


0.07 


0.07 


2.10 


2.00 


2.05 


1 


.2000 


0.07 


0.06 


0.06 


1.95 


1.96 


1.95 


1 


:4000 


0.06 


0.06 


0.06 


1.77 


1.82 


1.79 


li 


iSOOO 


0.06 


0.06 


0.06 


1.79 


1.81 


1.80 


1 


:16000 


0.06 


0.06 


0.06 


1.54 


1.50 


1.52 


1 


:32000 


0.06 


0.06 


0.06 


1.27 


1.20 


1.24 


1 


64000 


0.06 


0.06 


0.06 


0.85 


0.82 


0.83 


0 


0.06 


0.06 


0.06 


0.06 


0.06 


0.06 



5 Tables 10-12 show the affinity pxjrification of the respective antibodies 

to the three lung antigens, L523S, L763P and L763 peptide #2684. 



Table 10 



Antibody 
cone 
(|ig/ml) 


Affinity 
pure 
(salt 
peak) 


Affinity 
pure 
(salt 
peak) 


Average 


Affinity 
pure 
(acid 
peak) 


Affinity 
pure 

(acid 
peak) 


Average 


1.0 


238 


2.35 


2.36 


2.25 


2.31 


2.28 


0.5 


2.24 


2.22 


2.23 


2.19 


2.18 


2.18 


0.25 


2.05 


2.09 


2.07 


2.01 


2.03 


2.02 


0.13 


1.70 


1.81 


1.75 


1.74 


1.74 


1.74 


0.063 


1.44 


1.44 


1.44 


1.43 


1.38 


1.40 


0.031 


1.05 


1.05 


1.05 


0.99 


0.99 


0.99 


0.016 


0.68 


0.67 


0.68 


0.65 


0.64 


0.64 


0.0078 


0.43 


0.42 


0.42 


0.39 


0.39 


0.39 


0.0039 


0.27 


0.26 


0.27 


0.24 


0.26 


0.25 


0.0020 


0.18 


0.20 


0.19 


0.19 


0.18 


0.19 


0.0010 


0.13 


0.14 


0.13 


0.13 


0.14 


0.13 


0.00 


0.11 


0.12 


0.11 


0.10 


0.12 


0.11 
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Table 11 



Antibody 
dilution 


AfRnity 
pure 


Affinity 
pure 


AveracTA 


1:1000 


1 .OH 


1.77 


1.70 


1:2000 


1.59 


1.76 




1:4000 


1.48 


1.62 


1.55 


1:8000 


1.35 


1.43 


1.39 


1:16000 


1.09 


1.19 1 


1.14 


1:32000 


0.81 


0.89 


0.85 


1:64000 


0.55 


0.58 


0.56 


1:128000 


0.31 


0.35 


0.33 


1:256000 


0.18 


0.20 


0.19 


1:512000 


0.11 


0.12 


0.11 


1:1024000 


0.07 


0.07 


0.07 


1:2048000 


0.06 


0.06 


0.06 



Table 12 



Antibody 
cone. 
(|ig/mJ) 


Afiinity 
pure 


Afiinity 
pure 


Average 


1.0 


2.00 


2.02 


2.01 


0.5 


2.01 


1.93 


1.97 


0J5 


1.84 


1.83 


1.84 


0.13 


1.80 


^ 1.83 


1.81 


0.06 


1.39 


1.60 


1.50 


0.03 


1.33 


1.35 


1.34 


0.02 


0.94 


0.93 


0.94 


0.00 


0.06 


0.06 


0.06 



EXAMPLE 24 
HjLLjyPJGTH cDNA SKOI tfmpt; ENCODTNfi T sooq 
The isolation of a partial sequence (SEQ ID NO:106) for lung antigen 
L529S ^vas previously p„>vided in Example 2. This partial sequence was used as a 

163 



wo 02/00174 



PCT/USOl/21065 



query to identify potential fUll length cDNA and protein sequences by searching against 
publicly available databases. The predicted full-length cDNA sequence for the isolated 
cloned sequence of SEQ ID NO: 106 is provided in SEQ K> NO:442. The deduced 
aniino acid sequence of the antigen encoded by SEQ ID NO:442 is provided in SEQ ID 
5 NO:443. It was previously disclosed in Example 2 that L529S shows similarity to 
connexin 26, a gap junction protein. 

EXAMPLE 25 

EXPRESSION IN MEGATERIUM OF A HISTIDINE TAG-FREE L523S FUSION 
10 PROTEIN 

PGR was performed on the L523S coding region with the following 

primers: 

Forward prima: PDM-734 5' caatcaggcatgcacaacaaactgtatatcggaaac 3* 
15 (SEQ ID NO:444) Tm 63**C. 

Reverse primer PDM-735 5' cgtcaagatcttcattacttccgtcttgac 3* (SEQ ID 
NO:445)TM60°C. 

The PGR conditions were as follows: 
10^U lOXPfu buffer 
20 l.OnllOmMdNTPs 

2.0nl 10|jM each primer 
83^1 sterile water 

1 .Sfxl Pfu DNA polymerase (Stratagene, La JoUa, CA) 
SOrigDNA 

25 96^C for 2 minutes, 96°C for 20 seconds, 62°G for 15 seconds, 72^G for 

4 minute with 40 cycles and then 72''C for 4 minutes. 

The PGR product was digested with SphI and Bglll restriction enzymes, 
gel purified and then cloned into pMEG-3, which had been digested with SphI and Bglll 
restriction enzymes. The correct construct was conjSrmed by DNA sequence analysis 

30 and then transformed into Megaterium cells for expression. 
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10 



The amino acid sequence of expressed recombinant L523S is shovm in 
SEQ m NO:446, and the DNA coding region sequence is shown in SEQ ID NO:447. 



EXAMPLE 2fi 

EXPRESSION IN E. COU OF A HISTTDINE TAG-FREE L523S FUSION PROTEIN 



primers: 



PGR was petfoimed on the L552S coding region with the following 



•Forward primer PDM-733 5' cgtactagcatatgaacaaactgtatatcggaaac 3' 
(SEQIDNO:448)Tm64''C. 

Reverse primer PDM-415 5' ccatagaattcattacttccgtcttgactgagg 3' (SEQ 
IDNO:426)TM62"'C. 

The PGR conditions were as follows: 
^5 lOHllOXPfubuflFer 

I.OhI lOmMdNTPs 
2.0nl lO^M each primer 
83nl sterile water 

l.Sfil Pfii DNA polymerase (Stratagene, La Jolla, GA) 
20 SOrigDNA 

96»G for 2 minutes. 9&>C for 20 seconds. 62''G for 15 seconds, 72»G for 
4 minute with 40 cycles and then 72°G for 4 minutes. 

The PGR product was digested with Ndel and EcoRI restriction 
enzymes, gel purified and then cloned into pPDM, a modified pET28 vector, which had 
25 been digested with Ndel and EcoRI restriction enzymes. The conect construct was 
confinned by DNA sequence analysis and then transfonned into BLR pLys S and HMS 
174 pLys S cells for expression. 

The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:449, and the DNA coding region sequence is shown in SEQ ID NO-450 

30 
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EXAMPLE 27 

EPITOPE-ANALYSIS OF L514S AND L523S-SPECinC ANTIBODIES 

5 Peptides of candidate antigens can be used for tiie evaluation of antibody 

responses in botii preclinical and clinical studies. These data allow one to further 
confirm the antibody response against a certain candidate antigen. Protein-based ELISA 
with and without competitive peptides and peptide-based ELISA can be used to 
evaluate these antibody responses. Peptide ELISA is especially useful since it can 

10 further exclude the false positive of the antibody titer observed in protein-based ELISA 
as well as to provide the simplest assay system to test antibody responses to candidate 
antigens. In this example, data was obtained using both L514S- and L523S-peptides 
that show that individual cancer patients produce L514S- and L523S-specific 
antibodies. The L5 14S-specific antibodiesiecognize primarily the following epitope of 

15 L514S: 

aa86-110: LGKEVRDAKITPEAFEKLGFPAAKE (SED ID 

NO:451). 

This epitope is the common epitope in humans. A rabbit antibody 
specific for L514S recognizes two addition epitopes of L514S: 
20 (1) aa2M5: KASDGDYYTLAVPMGDVPMDGISVA (SEQ ID 

.NO:452) 

(2) aal21-135: PDRDVNLTHQLNPKVK (SED ID NO:453) 
It was further found that the SEQ ID NO:452 is common to both L514S 
isoforms, L514S-13160 and L514S-13166, whereas the other epitopes, SEQ ID NO:451 
25 and SEQ ID NO:453, are probably specific to the isofonn, L514S-13 160. 

The L523S-specific antibodies recognize primarily the following epitope 

ofL523S: 

aa440-460: lOAPAEAPDAKVRMVnTGP (SEQ ID NO:454). 
This epitope is the common epitope in humans. A rabbit antibody 
30 specific for L523S recognizes two other q)itopes: 
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(1) aal56-175 PDGAAQQNNNPLQQPRG (SEQ © 
NO:455) 

(2) aa326-345:RTrrVKGNVETCAKAEEEIM (SED ID 
NO:456) 

5 

In further studies, it was detennined by peptide based ELISAs that eight 
additional epitopes of L523S recognized by L523S-specific antibodies: 

(1) aa40-59 AFVDCPDESWALKAIEALS (seq id 
NO:457) 

(2) aa80-99: IRKLQIRNIPPHLQWEVLDS (SED ID 
NO:458) 

(3) aal60-179: AQQNPLQQPRGRRGLGQRGS (SEQ ID 
NO:459) 

(4) aal80-199: DVHRKENAGAAEKSimST (SED ID 

. J 5 NO:460) 

(5) aa320-339: LYNPERTITVKGNVETCAKA (SEQ ID 
NO:461) 

(6) aa340-359: EEEIMKKIRESYENDIASMN (SED ID 
NO:462) 

20 (7)aa370-389:LNALGLFPPTSGMPPPT^GP (SEQ ID 

NO:463) 

(8)aa380.399:KIAPAEAPDAKVRMVIITGP (SED ID 

NO:464) 

Out of these, six epitopes are common in both lung plural effusion fluid 
25 samples and in sera of lung patients. Of these six, SEQ ID NO:459 and SEQ ID 
NO:463 have no homology to other L523S-family proteins such as IGF-n mRNA- 
binding proteins 1 and 2. Accordingly, this indicates that these two peptides can be 
used as an assay system to determine the antibody response to L523S. 

30 
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EXAMPLE 28 

Generation of L523S-Specific CTL Lines Using In Vitro Whole-Gene Priming 

To detennine if L523S is capable of generating a CD8* T cell immune 
5 response, CTLs were generated using in vitro whole-gene priming methodologies with 
tumor antigen-vaccinia infected DC (Yee et al. The Journal of Immunology, 
157(9):4079-86, 1996), human CTL lines were derived that specifically recognize 
autologous fibroblasts transduced with the L552S tumor antigen, as determined by 
interferon-gamma ELISPOT analysis. Specifically, dendritic cells (DC) were 

10 differentiated from Percoll-purified monocytes derived from PBMC of normal human 
donors by plastic adherence and growing for five days in RPMI medium containing 
10% human serum, 50 ng/ml human GM-CSF and 30 ng/ml human IL-4. Following the 
five days of culture, the DC were infected overnight with a recombinant adenovirus that 
expresses L523S at a multipUcity of infection (M.O.I) of 33, 66 and 100, and matured 

15 overnight by the addition of 2 |ig/ml CD40 ligand. The virus was then inactivated by 
UV irradiation. In order to generate a CTL line, autologous PBMC were isolated and 
CD8+ T cells were enriched for by the negative selection using magnetic beads 
conjugated to CD4+, CD14+, CD16+, CD19+, CD34+ and CD56+ cells. CD8+ T cells 
specific for L523S were established in round bottom 96-well plates using 10,000 L523S 

20 expressing DCs and 100,000 CD8+ T cells per well in RPMI si^pplemented witii 10% 
human serum, lOng/ml of IL-6 and 5ng/ml of IL-12. The cultures were restimulated 
every 7-10 days using autologous primary fibroblasts retrovirally transduced with 
L523S, and the costimulatory molecule CD80 in the presence of E^-2. The cells were 
also stimulated with IFN-gamma to upregulate MHC Class I. The media was 

25 supplemented with lOU/ml of IL-2 at the time of stimulation as well as on days 2 and 5 
following stimulation. Following three stimulation cycles, ten L523S specific CD8+ T 
cell lines were identified using interferon-gamma ELISPOT analysis that specifically 
produce interferon-gamma when stimulated with the L523S tumor antigen-transduced 
autologous fibroblasts, but not with a control antigen. 
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One line. 6B1. was cloned using anti.CD3 and feeder cells. Tht clones 
were tested for specificity on L523S-transduced fibroblasts. In addition, using a panel 
of HLA-nusn^atched lines transduced with a vector expressing L523S and measuring 
interferon-g^ pn,duction by this CTL line in an ELISPOT assay, it was determined 
5 that this clone 6B1.4B8 is restricted by HLA-A0201. ' 

Also u^g transfected Cos ceUs. it was shown that clone 6B1 4B8 
recognizes Cos cells transfected with pcDNAS HLA A0201/L523S in an HLA- 
restricted and antigen specific manner. 

An epitope mapping study demonstrated the clone 6B1.4B8 recognizes 

10 HLA-A201LCLloadedwithpeptidepool3(apol3peptideconespondingtoa^^ 
positions 33-59 of L523S. 

A peptide pool breakdown study demonstrated that clone 6B1 4B8 
recogmzes autologous B-LCL loaded with 15-mer peptides from amino acid positions 
37-55 of L523S, TGYAFVCPDESWALKAIE (SEQ ID NO:465). A fimher peptide 
15 breakdown study demonstrated that clone 6B1.4B8 recognizes T2 cells loaded with the 
same 15-mer peptides. 

A peptide recognition study demonstrated that clone 6B1.4B8 prefere T2 
cells loaded with the peptide FVDCPESWAL (SEQ ID NO:466) which is corresponds 
to the ammo add sequence at positions 41-51 of L523S and is encoded by the DNA 
20 sequence of SEQ ID NO:467. 



25 



EXAMPI.F.:>0 
L523S EXPRESSION IN OTHER HUMAN CANCERS 

It was previously disclosed in Example 2 that L523S is expressed in lung 
cancers including squamous, adenocarcinoma and small cell carcinoma EST profiling 
analyisis of L523S fimher indicates that this protein may also be expressed in a number 
other tumor tjpes. inclduing colon adenocarcinomas, prostate adenocarcinomas CML 
AML. Burkitt's Lymphoma, brain tumors, retinoblastomas, ovarian tumors. 
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teratocarcinomas, uterus myosarcomas, genn cell tumors as well as pancreatic and 
cervical tumor cell lines. 

From the foregoing it will be appreciated that, although specific 
5 embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating fi-om the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 



1. An isolated polynucleotide comprising a sequence selected from 
the groiq) consisting of: 

(a) sequences provided in SEQ ID NOs:351, 353, 358 362 364 
366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467; 

(b) complements of the sequences provided in SEQ ID NOs:351 
353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467;' 

(c) sequences consisting of at least 10 contiguous residues of a 
sequence provided in SEQ ID NOs:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 
424, 428, 43 1 , 434, 442, 447, 450 and 467; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
NOs:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447. 
450 and 467, under highly stringent conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 
NOs:351, 353, 358, 362, 364. 366, 368, 370-375. 420. 424. 428, 431, 434, 442 447 
450 and 467; ' ' ' 

(f) sequences having at least 90% identity to a sequence of SEQ ID 
NOs:351. 353, 358. 362. 364. 366, 368, 370-375, 420. 424. 428. 431. 434. 442, 447 
450 and 467; and 

(g) degenerate variants of a sequaice provided in SEQ ID NOs:35 1 
353, 358, 362. 364. 366. 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467.' 

2. An isolated polypeptide comprising an amino acid sequence 
selected from flie group consisting of: 

(a) sequences having at least 90% identity to a polypeptide having an 
amino acid sequence of any one of the sequences provided in SEQ ID NOs:352 354 
357, 361, 363. 365. 367, 369, 376-382, 387-419. 423. 427. 430. 433, 441, 443, 44^, 449' 
and 451-466; 
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(b) sequences encoded by a polynucleotide of claim 1 ; 

(c) sequences having at least 70% identity to a sequence encoded by 
a polynucleotide of claim 1 ; and 

(d) sequences having at least 90% identity to a sequence encoded by 
a polynucleotide of claim 1 . 

3. An expression vector comprising a polynucleotide of claim 1 
operably linked to' an expression control sequence. 

4. A host ceU transformed or transfected with an expression vector 
according to claim 3. 

5. An isolated antibody, or antigen-binding fragment thereof, that 
specifically binds to a polypeptide of claim 2. 

6. A method for detecting the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample v^ith a binding agent that binds 
to a polypeptide of claim 2; 

(c) detecting in the sample an amount of polypeptide that binds to 
the binding agent; and 

(d) comparing the amount of polypeptide to a predetermined cut-off 
value and therefi'om determining the presence of a cancer in the patient. 

7. A fusion protein comprising at least one polypeptide according to 

claim 2. 
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8. A fusion protein according to claim 9, wherein the fusion protein 
is selected from the group consisting sequences provided in SEQ JD NOs:352, 354, 423 
427, 430 and 433. 



9. An oligonucleotide that hybridizes to a sequence recited in SEQ 
ID NOs:351, 353, 358. 362, 364, 366. 368. 370-375. 420. 424, 428, 43 1, 434. 442. 447. 
450 and 467 under highly stringent conditions. 

10. A method for stimulating and/or expanding T cells specific for a 
tumor protein, comprising contacting T ceUs xvith at least one component selected from 
the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1 ; and 

(c) antigen-presenting cells that express a polynucleotide according 

to claim 1, 

under conditions and for a time sufBcient to permit the stimulation 
and/or expansion of T cells. 

11. An isolated T cell population, comprising T ceUs prepared 
according to the method of claim 10. 

12. A composition comprising a first component selected from the 
group consistmg of physiologically acceptable carriers and immunostimulants, and a 
second component selected from the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1 ; 

(c) antibodies according to claim 5; 

(d) fiision proteins according to claim 7; 

(e) T cell populations according to claim 1 1; and 
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(f) antigen presenting cells that express a polypeptide according to 

claim 2. 

13. A method for stimulating an immune response in a patient^ 
comprising administering to the patient a composition of claim 12. 

14. A method for the treatment of a lung cancer in a patient, 
comprising administering to the patient a composition of claim 12. 

15. A method for determining the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide 
according to claim 9; 

(c) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; and 

(d) compare the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence of the cancer in the patient. 

16. A diagnostic kit comprising at least one oligonucleotide 
according to claim 9. 

17. A diagnostic kit comprising at least one antibody according to 
claim 5 and a detection reagent, wherein the detection reagent comprises a reporter 
group. 
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18. A method for the treatment of lung cancer in a patient, 
comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient 
with at least one component selected from the group consisting of: (i) polypeptides 
according to claim 2; (ii) polynucleotides according to claim 1; and (iii) antigen 
presenting ceUs that express a polypeptide of claim 2, such that T cell proliferate; 

(b) administering to the patient an effective amount of the 
proliferated T cells, 

and thereby inhibiting the development of a cancer in the patient. 
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SEQUENCE LISTING 

<110> Corixa Corporation 
Wang, Tongtong 
Marnerakis , Margarita 
Fanger, Gary R. 
Vedvick, Thomas S^ 
Carter, Darrick 
Watanabe, Yoshihiro 
Henderson, Robert A. 
Peckham, David W. 
Fanger, Neil 



<120> COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF LONG CANCER 



<130> 210121. 45502PC 

<140> PCT 

<141> 2001^06-28 



<160> 467 



<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 315 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 236, 241 
<223> n = A, T, C or G 



<400> 1 

gcagagacag actggtggtt gaacctggag 

cagctgccgt gagactcccg atgtcacagg 

ttcatctcca gcagagacaa cggaggaggc 

gttaatatgt ttgtaaactc atgtacagtt 

naaattacaa atagaatcat ttgctgtaat 
aaaaaaaaaa aaaaa 

<210> 2 
<211> 380 
<212> DNA 

<213> Homo sapiens 



gtgccaaaaa agccagctgc gggcccagga 60 
cagtctgtgt ggttacagcg cccctcagtg 120 
tcccaccagg acggttctca ttatttatat 180 
ttttttgggg gggaagcaat gggaanggta .240 
ccttaaatgg caaacggtca ggccacgtga 300 

315 



<400> 2 

atttaggctt aagattttgt ttacccttgt 
atatatataa acaaatacaa aaagttttga 
cataactttt aacaacactg ctctgtaatg 
ctgaaatgag tggatgtata gtgttattgc 
ggataaattc ccagtctaga ttattagcct 



tactaaggag caaattagta ttaaagtata 60 
gtggttcagc ttttttattt tttttaatgg 120 
ggttgaactg tggtactcag actgagataa 180 
ataattatcc cactatgaag caaagggact 240 
ttgttaacca tcaagcacct agaagaagaa 300 
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Saala^::: aaaaSSaa '^'^^^'^^'^ ^^'^"^^^^^ ^^-"^^^ "^^^-ttt 360 

380 

<210> 3 

<211> 346 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

tllTsll'/lls' '''' '''' '''' '''' 331. 336, 337, 339, 

<223> n = A,T,C or G 

<400> 3 

SS iE= iSS SS SiS SS i: 

<210> 4 

<211> 372 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 297, 306, 332 
<223> n « A,T,C or G 

<400> 4 

H= EiS ^ ~ « So 

™- ~- sss?s SIS 3io° 

372 

<210> 5 

<211> 698 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> iaisc__feature 

Sr=!j/fe.1^i/g;/L^;/--||j/-; -l' 'II' -J 

tV.: Ill: If,: ill: e.; L'^: IJI: ^15; SI; 

<223> n = A,T,C or G 
<400> 5 

== is S Hi- == Si£ i 
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caatacacac tcatgaactc ctgatggaac aataacaggc ccaagcctgt ggtatgatgt 240 
gcacacttgc tagactcaga aaaaatacta ctctcataaa tgggtgggag tattttgggt 300 
gacaacctac tttgcttggc tgagtgaagg aatgatattc atatnttcat ttattccatg 360 
gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 420 
tntccaaatn ttngtncngt cgctgcacat atctgaaatc ctatattaag antttcccaa 480 
natgangtcc ctggtttttc cacgccactt gatcngtcaa ngatctcacc tctgtntgtc 540 
ctaaaaccnt ctnctnnang gttagacngg acctctcttc tcccttcccg aanaatnaag 600 
tgtgngaaga nanccncncn cccccctncn tncnncctng ccngctnnnc cncntgtngg 660 
gggngccgcc cccgcggggg gacccccccn ttttcccc 698 

<210> 6 

<211> 740 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 82, 406, 426, 434, 462, 536, 551, 558, 563, 567, 582, 584, 
592, 638, 651, 660, 664, 673, 675, 697, 706, 711, 715, 716, 
717, 723, 724, 725, 733 
<223> n = A,T,C or G 

<400> 6 

actagtcaaa aatgctaaaa taatttggga gaaaatattt tttaagtagt gttatagttt 60 
catgtttatc ttttattatg tnttgtgaag ttgtgtcttt tcactaatta cctatactat 120 
gccaatattt ccttatatct atccataaca tttatactac atttgtaaga gaatatgcac 180 
gtgaaactta acactttata aggtaaaaat gaggtttcca agatttaata atctgatcaa 240 
gttcttgtta tttccaaata gaatggactt ggtctgttaa ggggctaagg gagaagaaga 300 
agataaggtt aaaagttgtt aatgaccaaa cattctaaaa gaaatgcaaa aaaaaattta 360 
ttttcaagcc ttcgaactat ttaaggaaag caaaatcatt tcctanatgc atatcatttg 420 
tgagantttc tcantaatat cctgaatcat tcatttcagc tnaggcttca tgttgactcg 480 
atatgtcatc tagggaaagt ctatttcatg gtccaaacct gttgccatag ttggtnaggc 540 
tttcctttaa ntgtgaanta ttnacangaa attttctctt tnanagttct tnatagggtt 600 
aggggtgtgg gaaaagcttc taacaatctg tagtgttncg tgttatctgt ncagaaccan 660 
aatnacggat cgnangaagg actgggtcta tttacangaa cgaatnatct ngttnnntgt 720 
gtnnncaact ccngggagcc 740 

<210> 7 

<211> 670 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

tc222> 265, 268, 457, 470, 485, 546, 553, 566, 590, 596, 613, 624, 
639, 653, 659, 661 
<223> n = A,T,C or G 



<400> 7 

gctggggagc tcggcatggc ggtccccgct 
agcggccccg gctcgatggc cccgtggtgc 
cttgggatgc aggagctgtt ccggggccac 
ccaaggtgca ctcggtggcc tggagttgcg 
aagacgccac gtcttcttgc tgganaanga 
catggggata gtgtggacca ctttgttggc 
cgtctggaga taaaaccatt cgcatctggg 
tgaacactaa aggggagaac attaatatct 
tagcnacaag gatgatgtgg tgactttatt 



gcagccatgg ggccctcggc gttgggccag 60 
tcagtgagca gcggcccgtc gcgctacgtg 120 
agcaagaccg cgagttcctg gcgcacagcg 180 
acgggcgtcg cctacctcgg ggtcttcgac 240 
ccgttggtca aagaaaacaa ttatcgggga 300 
atccaagtaa tcctgaccta tttgttacgg 360 
atgtgaggac tacaaaatgc attgccactg 420 
gctggantcc tgatgggcan accattgctg 480 
gatgccaaga aaccccgttc caaagcaaaa 540 
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llltT^lt r^Tol^^^^ StelLT^ ^^'^^^^^^^ ^^^^^^"^^ 

natccacccc i:gi:m:cacat cctcagctnc cccaaaactg aancctgtnc 660 

670 

<210> 8 

<211> 689 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

«; p; ^^5: ^li S?;' S i 1 - 

<223> n = A, T, C or G 
<400> 8 

SS Eisi i~ So 

gatatatntt cctagtgg?? tTacTtnll IXtlf"'' ^"^^^^^tn tccaggctgt 420 
cnntnctncc nntc^c?cnn cnrtcccccc cScn^o. gtttantttt ctccccccnn 480 
cccccncggn ggacccccct ttaatccct^ Z^ ^^ tccnnnnttn gggggggccn 540 
nggccntann ??tccccSn nS2a?S^ f^^"" natggcccct ggnnttatcc 600 

aagcctaagt ttntaccSg gggSS ccccctccca ntcccnccac ctcaanccgg 660 

<210> 9 

<211> 674 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 
<222> 602, 632, 639, 668 
<223> n - A,T,C or G 

<400> 9 

irxtir. iiiir''^' "^^^^^^^^ «o 

gaaaaaagcg aggctttttt Tacaccttll tttl '^''^ cccaaggcag caagataact 120 

ataagcctga ag^gaagtag ct^J^act ItlHH^l TtlTtlT ^^^^-"^^^ "° 
ccttcatgga aaaaggcttc ctataa^aa^ ttccattttt cttagttctc ccaataggct 240 
ttctgaaa?a agagSaaat SSccSaa tTt^^'t^ tgaattagca gtgtgattat 300 
aaagttttgt t?g^c??ca clllZalat f ^^^^9 tgatttaaaa taaacaaccc 360 
caaaaacatt agc5?"?g tc?ttcaatt IT.nlt^ff Sgtatgttgt tgaagacatt 420 
agttaattac t?tg?tctgg alllTgcTtl IttTrltl J'T^"^'^ cctccatgtg 480 
catctgaata atattgtgga tttc^^cS" ?gc?^g;2c u'cttttf T^T^-* ^40 
anaaatgtca aaaaaaaagg tcgatctact cnar^™ ttcttttgac tcctctggga 600 
aggacccnct gccc ^^e^^ctact cngcaaggnc catctaatca ctgcgctgga 660 

674 

<210> 10 
<211> 346 
<212> DNA 
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<213> Hcsmo sapiens 
<220> 

<221> misc_feature 

<222> 320, 321, 322, 325, 326, 328, 329, 330, 332, 333, 334, 335, 
342 

<223> n = A,T,C or G 
<400> 10 

actagtctgc tgatagaaag cactatacat cctattgttt ctttctttcc aaaatcagcc 60 
ttctgtctgt aacaaaaatg tactttatag agatggagga aaaggtctaa tactacatag 120 
ccttaagtgt ttctgtcatt gttcaagtgt attttctgta acagaaacat atttggaatg 180 
tttttctttt ccccttataa attgtaattc ctgaaatact gctgctttaa aaagtcccac 240 
tgtcagatta tattatctaa caattgaata ttgtaaatat acttgtctta cctctcaata 300 
aaagggtact tttctattan nnagnngnnn gnnnnataaa anaaaa 346 

<210> 11 

<211> 602 

<212> DNA 

<213> Homo sapiens 

<400> 11 

actagtaaaa agcagcattg ccaaataatc cctaattttc cactaaaaat ataatgaaat 60 

gatgttaagc tttttgaaaa gtttaggtta aacctactgt tgttagatta atgtatttgt 120 

tgcttccctt tatctggaat gtggcattag cttttttatt ttaaccctct ttaattctta 180 

ttcaattcca tgacttaagg ttggagagct aaacactggg atttttggat aacagactga 240 

cagttttgca taattataat cggcattgta catagaaagg atatggctac cttttgttaa 300 

atctgcactt tctaaatatc aaaaaaggga aatgaagtta taaatcaatt tttgtataat 360 

ctgtttgaaa catgagtttt atttgcttaa tattagggct ttgccccttt tctgtaagtc 420 

tcttgggatc ctgtgtagaa ctgttctcat taaacaccaa acagttaagt ccattctctg 480 

gtactagcta caaattcggt ttcatattct acttaacaat ttaaataaac tgaaatattt 540 

ctagatggtc tacttctgtt catataaaaa caaaacttga tttccaaaaa aaaaaaaaaa 600 

aa 602 

<210> 12 

<211> 685 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 170, 279, 318, 321, 322, 422, 450, 453, 459, 467, 468, 470, 
473, 475, 482, 485, 486, 491, 498, 503, 506, 509, 522, 526, 
527, 528, 538, 542, 544, 551, 567, 568, 569, 574, 576, 582, 
587, 588, 589, 590, 592, 593, 598, 599, 603, 605, 608 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 633, 634, 635, 644, 646, 648, 651, 655, 660, 662, 663, 672, 
674, 675, 682, 683 
<223> n « A,T,C or G 

<400> 12 

actagtcctg tgaaagtaca actgaaggca gaaagtgtta ggattttgca tctaatgttc 60 
attatcatgg tattgatgga cctaagaaaa taaaaattag actaagcccc caaataagct 120 
gcatgcattt gtaacatgat tagtagattt gaatatatag atgtagtatn ttgggtatct 180 
aggtgtttta tcattatgta aaggaattaa agtaaaggac tttgtagttg tttttattaa 240 
atatgcatat agtagagtgc aaaaatatag caaaaatana aactaaaggt agaaaagcat 300 



wo 02/00174 



6 



PCT/USOl/21065 



agaStI? ctlSgtlgt'g Tct^Zmt S^^^^^^ to^.^.t.,. tgccaggcag 360 
angtagigic ct?g?aggtg ^cacataa^n J^^f tgaagaactt ccctcaogtg 420 

ancanng?ga nartttcncc atn^f tantggganc aggccgimcn gtnanaagaa 480 
cntntcSS TXlTc,: TttllclZl SSa^fS r^^^^^- gctcccaLa 540 
cantntgnta accccgcgcc JaoaWn?^ tccngnaacc tngccgnnnn cnngcccnnc 600 
cnnccg^^gt cncnnccSg cScc^ tcnnntcgtt ctcncncnaa ngggntttcn 660 

685 

<210> 13 

<211> 694 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

679^61?''''''' '''' 658, 662, 676, 

<223> n = A, T, C or G 

<400> 13 

iT^zii iiiisisi sssj 
.'SXM s« s= £• 

2= ~ bEi i?"- 

694 

<210> 14 

<2H> 679 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

400, 441, 416, 461, 492 , 504 , 505 512 525 ' 527 II,' 1^.' 

<2i;>'r: Sc'i^."°• ■ 

<400> 14 

cagccgcctg catctgtatc cagcgocang tcccaccsaf nr-r-r.^^^- 
agtcccgnac ccgttcggcc cangrtnag? SncctSc catncSa^^ cgcgcccccc 60 
ccaagtgcat caaatacctg cngtncgg^t nKaattcat ^^^^^^^^^ aaaggangca 120 
ctgtccntgc cattggacta nggctccgat ncaactc^n! cttctggctt gccgggattg 180 
naganactaa tnatnattnt tccagSS aScaaoaat IITITT ^^cttcganc 240 
gcnccctcnt gatgctggtg ggcttcctSa IZZll^^ ctatattctg atcggatccg 300 
gcatgctggg actgtt??tc SgStcStS Ifalf^l If^f^^ gagtcccant 360 
ccatctgggg atattccact ncgatnatgt gSlaaaaa n?^^ atacctgcgg 420 

acacgtacaa cnacctgaaa acLnggaJg SccccScg oaaaS^^ ttttacaagg 480 
actatgcgtt gaactgcaat ggtttggctg Sgncct?q! fcl^^tt ^^"^^'^^^^'^ ^40 
tggccccann aaaggacntn ctcgaL^cc? HT^iT^. g™^^ ^ 
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7 



cagaagtctc gaacaatcc 679 

<210> 15 

<211> 695 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__feature 

<222> 105, 172, 176, 179, 189, 203, 212, 219, 221, 229, 231, 238, 
242, 2.61, 266, 270, 278, 285, 286, 298, 311, 324, 337, 350, 
363, 384, 391, 395, 405, 411, 424, 427, 443, 448, 453, 455, 
458, 463, 467, 470, 479, 482, 484, 493, 499, 505, 518 
<223> n = A,T,C or G 

<221> misc_f eature 

<222> 520, 523, .531, 540, 584, 595, 597, 609, 611, 626, 628, 651, 
652, 657, 661, 665, 669, 672, 681, 683, 691, 693 
<223> n - A,T,C or G 



<400> 15 

actagtggat aaaggccagg gatgctgctc aacctcctac catgtacagg gacgtctccc 60 
cattacaact acccaatccg aagtgtcaac tgtgtcagga ctaanaaacc ctggttttga 120 
ttaaaaaagg gcctgaaaaa aggggagcca caaatctgtc tgcttcctca cnttantcnt 180 
tggcaaatna gcattctgtc tcnttggctg cangcctcanc ncaaaaaanc ngaactcnat 240 
cnggcccagg aatacatctc ncaatnaacn aaattganca aggcnntggg aaatgccnga 300 
tgggattatc ntccgcttgt tgancttcta agtttcnttc ccttcattcn accctgccag 360 
ccnagttctg ttagaaaaat gccngaattc naacnccggt tttcntactc ngaatttaga 420 
tctncanaaa cttcctggcc acnattcnaa ttnanggnca cgnacanatn ccttccatna 480 
ancncacccc acntttgana gccangacaa tgactgcntn aantgaaggc ntgaaggaan 540 
aactttgaaa ggaaaaaaaa ctttgtttcc ggccccttcc aacncttctg tgttnancac 600 
tgccttctng naaccctgga agcccngnga cagtgttaca tgttgttcta nnaaacngac 660 
ncttnaatnt cnatcttccc nanaacgatt ncncc 695 

<210> 16 
<211> 669 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 299, 354, 483, 555, 571, 573, 577, 642, 651, 662, 667 
<223> n = A, T,C or G 



<400> 16 

cgccgaagca gcagcgcagg ttgtccccgt 
ttcccgggcc ccttacactc cacagtcccg 
agaaccctgc ggaggagacc ggcgaggaga 
tgcctgagag agctgaagag gcaaagctaa 
ctggaggctc cgacttcctc atgaagagac 
gagactacaa catggccaaa gccaacatga 
acaagaacct ggtgactggt gatcacatcc 
ctcgctcgtc accagcaagc ttgcgggtgg 
canatctgag acgcttccct ccctgcccca 
tgcttttgca gccangggtc aggaagtggc 
cctgttggtg tcccacccat ggagcccctg 
tntcttncc 



ttcccctccc ccttcccttc tccggttgcc 60 
gtcccgccat gtcccagaaa caagaagaag 120 
agcaggacac gcaggagaaa gaaggtattc 180 
aggccaaata cccaagccta ggacaaaagc 240 
tccagaaagg gcaaaagtac tttgactcng 300 
agaataagca gctgccaagt gcangaccag 360 
ccaccccaca ggatctgccc agagaaagtc 420 
ccaagttgaa tgatgctgcc ggggctctgc 480 
cccgggtcct gtgctggctc ctgcccttcc 540 
ncnggtngtg gctggaaagc aaaacccttt 600 
gggcgagccc angaacttga ncctttttgt 660 

669 



wo 02/00174 
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<210> 17 

<211> 697 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



iilpiiiiiip-" 

<221> misc^fieature 

if lli'iif I ii i £■ - 

<221> misc_feature 

<223> n - A,T,C or G 
<400> 17 

gcctgcccL gggaJCcc^ ScS^^ ccStnt^^ cacacgggga tcntggtnat 120 
ncctggctcn cnCTgcccna nc~S^ cccatntcac acccgnnccn tncgcccacn 180 
cncnSctcc ncSnacc? ScSc™ actcc^f g-cnnnctcn ttnncntctc 240 
ccacnacncc ntcnncncga anoJccnctc allttrr.n'''' ^^^ccccccc ccgcaancct 300 
ononacnncg cgntccccca SSS^™^ gcnctcngcc ccngccccct gccccocgoc 360 
agncacgcn? t^cgcccnc? aaSc™ f ^"^^^^^^ cccacnacag ncncacocgc 420 
ccccgcLnc nccnctgcnc ?ccan^ -tcaccttcat ggnccnacng 480 

ccccngcngn angcng?gca cnnS^nn ^^^^^^^^^ccc cnnccgngtn ccncncgnng 540 
cgcccgctgg gggcScac c™^f! gngccgnncn ncaccctccg nccnccgccc 600 
cnncn^tcL' c^SS^Sc iT^^^T'' ^^'^^'^^^'^^^ 

697 

<210> 18 
<211> 670 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

tfe%ir/2?/<,li:\tT/,il'' »«. =«. «o, 5„, 

<223> n = A,T,C or G 



<400> 18 



SScSS '.nZtlll SSga^^ fJSSr^ ^'^-^^^-^ ctt 60 

gggacggctg cccgccgggc cccSaa^^ agacatggcc cagaacttga acgacttggc 120 

«t=.,,.. e«,i??s s~ ™ 
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aatctcctcc ctacaggctc caaagaccta 

tctcgaccaa tgctcangaa cttcctaaca 

gaacgantgt tgccgtccat tgtcacgaag 

gncctcacnn ctgatcnccc agcggggcca 

acnnaaaagg gccaaggact tcccctcatc 
tttanccacc 

<210> 19 

<211> 606 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 506 

<223> n = A,T,C or G 



cagatggtga atatctccct gcgagtgttg 420 
tgttccancg cctaagggct ggactacnaa 4 80 
tgctcaagaa tttnggtggc caagttcaat 540 
agttanccct ggttgatccc cgggganctg 600 
ctggataatg tggccntcac aaagctcaac 660 

670 



<400> 19 

actagtgcca acctcagctc ccaggccagt 
tggcctcagt tgtccttggt tattgatggg 
tgtcgccttg gctcaactgt ggttgatttg 
ccaggctgtg ccctggaaag tactacagcc 
tcacatgcgt cctacctgtg aaactctggg 
tactatgtgt ctgtccactg acgactgtca 
gggcactagc ctgactttta aggcagtgtg 
gagctgctgg tttagccttg cacctgggga 
cagccaaaag ctgaatggaa aagttnagaa 
tcttctgtct gttttgtttt tcaattgaaa 
gagacc 

<210> 20 

<211> 449 

<212> DNA 

<213> Homo sapiens 



tctctgaatg tcgaggagtt ccaggatctc 60 
ggacaaattg gggatggcca gagccccgag 120 
tctgtgcccg gaaagtttgg catcattcgt 180 
atcctccaac agaagtacgg actgctcccc 240 
aagcaggaag gcccaagacc tggtgctgga 300 
aggcctcatt tgcagaggcc accggagcta 360 
tctttctgag cactgtagac caagcccttg 420 
aaggatgtat ttatttgtat tttcatatat 4 80 
cattcctagg tggccttatt ctaataagtt 540 
agttattaaa taacagattt agaatctagt 600 

606 



<400> 20 

actagtaaac aacagcagca gaaacatcag 
cagcgccaga gccgaggaga acccccgctc 
ccaccacagc cgcctgccag gatggactcg 
tgccagaaca tcaaggagtt cactgcccaa 
cttcaagaat acaacaacta agaaaaggaa 
tgaagtcaca ccagggcaac tcttggaaga 
atttctttag tgtcattgcc gattttggct 
aaaacaaaat cttgactgct tgctcaaaa 

<210> 21 

<211> 409 

<212> DNA 

<213> Homo sapiens 



tatcagcagc gtcgccagca ggagaatatg 60 
cctgaggagg acctgtccaa actcttcaaa 120 
ctgctcattg caggccagat aaacacttac 180 
aacttaggca agctcttcat ggcccaggct 240 
gtttccagaa aagaagttaa catgaactct 300 
aatatatttg catattgaaa agcacagagg 360 
ataacagtgt ctttctagcc ataataaaat 420 

449 



<400> 21 

tatcaatcaa ctggtgaata attaaacaat 
caatgataaa aggaacaagc tgcctatatg 
tatgttgagt gaaagaacaa acacggagaa 
acagaaataa aaacagaggc aaccaccttt 
aaggaaggaa ggaaactcta cgctgatgga 
tgtggggata tacatttgtc aaaatttatt 
ttgggatgta aataatacct caattaaaaa 



gtgtggtgtg atcatacaaa gggtaccact 60 

tggaacaaca tggatgcatt tcagaaactt 120 

catactatgt ggttctcttt atgtaacatt 180 

gaggcagtat ggagtgagat agactggaaa 240 

aatgtctgtg tcttcattgg gtggtagtta 300 

gaactatata ctaaagaact ctgcatttta 360 

gacaaaaaaa aaaaaaaaa 409 
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<210> 22 
<211> 649 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 263, 353, 610, 635, 646 

<223> n = A,T,C or G 

<400> 22 



St'L'::." ii^^i^ii ssjjsi? itit^ttTtt :r"S" » 
2«S iizizii s~ f i-s t^z J^S 

ssss S Eli - 

gttgaagcag ggtgaaLac ^SggSa ^SSSSf Jtl'^f cgttgacagt 480 
gatgttttct ttggaatttc caaataaat^ ttttttgtaa gctgtttcat 540 

ct,a.,tta, ^:sst\T 12? 



<210> 23 
<211> 669 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 642, 661 
<223> n = A, T, C or G 

<400> 23 



tatcctctga cagcc?ttgg grtgcrtS! ^c^^!"''"^ aagatgtcag gaatgggatc 120 
tcacctgtcg tgLcccc?c rr^f'^r '^^'^^^cagga ggaggtgaca 180 

cgcaaggtgg tgctgatgc^ Tgcllll^ ZTtolTj. ^""^^^^^^ ggtggagact 240 
ctgacacttc tgctgaagtt aaaaaao^,; ^^^^''^^^^'3 aggagggagt caaacaccac 300 
ccaaatgaga atltlcolU SSt TalataT'' "^^^^^^^^^ tgacctgatg 360 
gctgaccaga gccggttgac ?tSSf ^^^^^^^^^^^ agctgggctt cattagtgag 420 
ggaacagtac cctcLct'S gcScSt^ ccSS^c -attttgcca 480 

gccctgatct gcgctgtggc tgtortaa^ ttagagctca ctcgggccag 540 

agtattacct gtgaagcS? tcccSf^ ^f^?"^"" ctctgtcctt ccccccagtc 600 
nttctaacc tccctccttt attattcagg anggctgggg gggotcoitg 



i <210> 24 
<211> 442 
<212> DNA 

<213> Homo sapiens 
<400> 24 



660 
669 



tZlTolTt otllilTclt SgSSi: aSr^^'^^ ^^^^"^^^^ -^^^--^ «0 
gatgactatc attatt^tag ^SStSat tJaf^^'^''^ ttcatgattt actttttcca 120 
cttacgatgc aettt.ctcc^ agSS^I S:-??:- -™ ^ 
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ggtaatgcac ttgctagtac tacacacttt 
cggaaagaga aaagccttcc tttgttggcc 
gatgatctct gacgatacct gtatgttctt 
gacctaaaaa aaaaaaaaga aa 

<210> 25 
<211> 656 
<212> DNA 

<213> Homo sapiens 



ggtacaacaa aaaacagagg caagaaacaa 300 
cttaaactga gtcaagatct gaaatgtaga 360 
attgtgtaaa taaaattgct ggtatgaaat 420 

442 



<220> 

<221> misc_feature 

<222> 330, 342, 418, 548, 579, 608 

<223> n = A,T,C or G 



<400> 25 

tgcaagtacc acacactgtt tgaattttgc 
ccccggaatg tacagtgtct tggtgcacca 
accctaatgg ggcagagagt atagccctag 
aggcctgagg tagaggggag tggtatgtgt 
gacaggatgt tagataaagg ctctagttag 
ctcctagcag ctggtaaagg ggtgctggan 
gggctgatct gattacttcc tggcatcccg 
atgggacagt tttccatatc cttgctgtgg 
attaaaaatc actgccctaa ctacacttcc 
tgacatantt cttggcatgg ggagccagcc 
ctcctganac tcatctacat agaattggtt 

<210> 26 

<211> 4 34 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 395 

<223> n = A,T,C or G 



acaaaaagtg actgtaggat caggtgatag 60 

agatgccttc taaaggctga cataccttgg 12 0 

cccagtggtg acatgaccac tccctttggg 180 

tttctcagtg gaagcagcac atgagtgggt 240 

ggtgtcattg tcatttgaga gactgacaca 300 

gccatggagg anctctagaa acattagcat 360 

ctcactttta tgggaagtct tattagangg 420 

agctctggaa cactctctaa atttccctct 480 

tccttgaagg aatagaaatg gaactttctc 540 

acaaatgana atctgaacgt gtccaggttt 600 

aaaccctccc ttggaataag gaaaaa 656 



<400> 26 

actagttcag actgccacgc caaccccaga aaatacccca catgccagaa aagtgaagtc 60 
ctaggtgttt ccatctatgt ttcaatctgt ccatctacca ggcctcgcga taaaaacaaa 120 
acaaaaaaac gctgccaggt tttagaagca gttctggtct caaaaccatc aggatcctgc 180 
caccagggtt cttttgaaat agtaccacat gtaaaaggga atttggcttt cacttcatct 24 0 
aataactgaa ttgtcaggct ttgattgata attgtagaaa taagtagcct tctgttgtgg 300 
gaataagtta taatcagtat tcatctcttt gttttttgtc actcttttct ctctaattgt 360 
gtcatttgta ctgtttgaaa aatatttctt ctatnaaatt aaactaacct gccttaaaaa 42 0 
aaaaaaaaaa aaaa 434 

<210> 27 

<211> 654 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 505, 533, 563, 592, 613, 635, 638 
<223> n A,T,C or G 
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<400> 27 



actagtccaa cacagtcaga aacattgttt taaatcr-i-ni- 

taataaacca ggatccat?t aggtaccact tS^aSaaa t^n^T^^ gcattaatct 60 
tttatactgc atcctttaca t?agccacta 2taS?ta? taatgaatat 120 

cagaatccta tggattgcag cat?tcactt ggrtS^ca SS^ff agacctttca 180 
gcagtttctc aaaagcagaa acatgccgcc ag^^SSaaa S^f ^ ttaaagaggg 240 
gaatgtaagg gcagctgocc occaato^^^^ ! ^ ttttcctcct aactccattt 300 

ttcttgttS cggctaS^g ^^^tttct^ llttlTc^tl L'^t^^l' -"-catt 360 
gtgttgattt acaaagagqc caoctaataa It^lT^ t ^^^tccgatc tttcccaaag 420 
attcaagctg tgagccaggc agga^cJcao tltaT gagagatgaa 480 

ggtacaaaaa aUtttllt gSatg? ^Sca??? L^L'^^"" tcngccattt 54 0 
aattgttaag aanaatttta a^gtccaj ^Sna^l^ tUTel'^, t^ir^'^' ^ 

<210> 28 

<211> 670 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

E^~S£^'^'^'^'^'^' - 

<400> 28 

gtalg'gS a\%1?Sg\^ ^llllZTot ^'^^^^ ^^'^^^^^ 

aggcagctta ttcgaactct gcggcSgcgg cJacaaaaco tcaacatcca 120 

gttcccggtg ctcctggtgt ctctctJoac gcggggtcoc tgctcccggc 180 

gtggggccag ctccccccgc ggSccSS cScS ^Sff " -ttctgagc 240 
aggaagatca ttagttcttt aaoaacottn «f f ^ ccatgctccc ggaaatcgag 300 
tatagggaat gtgggaaatc SgSS ?n?SJS f f^^^ctga aaaacactca 360 
ttgccaaaat ^tlllcaatc SSccSc cn^^f tntgatttct tgtgttttat 420 
tagtccgtct icacacacag aatSgaaaa caac« caaaaatcgg acntcngctt 480 

tattactaan ttttttctgt tggSJaaa aJ:atT''° ^^''^'^^"t tnantttnat 540 
ctanagttaa conagotagt tnSSaaaa ata^J^^^^^ acngccctgg ggccnccgta 600 
agaaaaagnc i^ncatgaaaa atgatgggct ccncctcaat gggaaagcca 660 

670 

<210> 29 

<211> 551 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

:s 11% 



<400> 29 



s:Sc\i: ™s -™ a™^^ ,,,,3 ,,^^3^^^^^ 

ccotctccag ccactgatgg gaaagtat^? tccatcf^^^ ctgtagaaaa ggtttcttct 120 
tcagtaccag aggtgcctga ?gt?gcacat ttacc^.i'^ ^^^^^^^ca gcaagaatct 180 
cctcttgact taagLgtgg ScagaaS^ aclatt ^ agaagctggg accctgtctc 240 
cgtaatgaat gtcccaggg? aSaaaaa aat^on tagcctcaga ttcctcttac 300 
aaagcaatag ctgatgggJa gaggagctS IgcaSSa ^^^"^^^ tccttcttcc 360 
aaaagtgaaa ttgggaagac aafag^tcS cag^mog ?aaaaf ' ""^^^^aaaa 420 
aggaaggaag agagaagaga gacnaagatc ncLc^S^ ^JO 
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aaaaaanaaa a 551 

<210> 30 

<211> 684 

<212> DMA 

<213> Homo sapiens 

<220> 

•<221> misc_feature 

<222> 545, 570, 606, 657, 684 

<223> n = A,T,C or G 

<400> 30 

actagttcta tctggaaaaa gcccgggttg gaagaagctg tggagagtgc gtgtgcaatg 60 

cgagactcat ttcttggaag catccctggc aaaaatgcag ctgagtacaa ggttatcact 120 

gtgatagaac ctggactgct ttttgagata atagagatgc tgcagtctga agagacttcc 180 

agcacctctc agttgaatga attaatgatg gcttctgagt caactttact ggctcaggaa 240 

ccacgagaga tgactgcaga tgtaatcgag cttaaaggga aattcctcat caacttagaa 300 

ggtggtgata ttcgtgaaga gtcttcctat aaagtaattg tcatgccgac tacgaaagaa 360 

aaatgccccc gttgttggaa gtatacagcg ggagtcttca gatacactgt gtcctcgatg 420 

tgcagaagtt gtcagtggga aaatagtatt aacagctcac tcgagcaaga accctcctga 480 

cagtactggg ctagaagttt ggatggatta tttacaatat aggaaagaaa gccaagaatt 540 

aggtnatgag tggatgagta aatggtggan gatggggaat tcaaatcaga attatggaag 600 

aagttnttcc tgttactata gaaaggaatt atgtttattt acatgcagaa aatatanatg 660 
tgtggtgtgt accgtggatg gaan 684 

<210> 31 

<211> 654 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_featiire 
<222> 326, 582, 651 
<223> n = A,T,C or G- 

<400> 31 

gcgcagaaaa ggaaccaata tttcagaaac aagcttaata ggaacagctg cctgtacatc 60 
aacatcttct cagaatgacc cagaagttat catcgtggga gctggcgtgc ttggctctgc 120 
tttggcagct gtgctttcca gagatggaag aaaggtgaca gtcattgaga gagacttaaa 180 
agagcctgac agaatagttg gagaattcct gcagccgggt ggttatcatg ttctcaaaga 240 
ccttggtctt ggagatacag tggaaggtct tgatgcccag gttgtaaatg gttacatgat 300 
tcatgatcag ggaaagcaaa tcagangttc agattcctta ccctctgtca gaaaacaatc 360 
aagtgcagag tggaagagct ttccatcacg gaagattcat catgagtctc cggaaagcag 420 
ctatggcaga gcccaatgca aagtttattg aaggtgttgt gttacagtta ttagaggaag 480 
atgatgttgt gatgggagtt cagtacaagg ataaagagac tgggagatat caaggaactc 540 
catgctccac tgactgttgt tgcagatggg cttttctcca anttcaggaa aagcctggtc 600 
tcaataaagt ttctgtatca ctcatttggt tggcttctta tgaagaatgc nccc 654 

<210> 32 

<211> 673 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 376, 545, 627 
<223> n = A,T,C or G 
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<400> 32 



actagtgaag aaaaagaaat tctaatacarr 

tatcacctga caccaggagt tttcattgga S^^aaf^^^ otcttcaaaa catcattctt 60 
ttaaagacca cacaaggaag caaaatcttt c^^ff ^ aacctggtgt tactaacatt 120 
aatgaattga aatcaaaagi atcJgacatc aSf ^^^^ taaatgatac acttctggtg 180 
gataaactcc tctatccagc agaSoacc? aSaf atggtgtaat tcatgttgte 240 
aataaattaa tcaaatacat cL^SSg ?SSeata ^^^^^^^'^^ ggaaata^t 300 
cccgtgactg tctatnagco aattattaaJ ^n!? gtagcacctt caaagaaatc 360 

tgtgggaaat aactgaaaaa gSgaccJaoa ao^^^*^"!^ aaatcattga tggglgtgcc 420 
atacctagga tttctactgg agSgaJK SgSaSc f J"^^^^*^ ctgaaatLa 480 
aagangtccc aaggtcacca aattcattga aS?aa?a^^ tctgaagaaa ttgttacaag 540 
gaaattaaaa gacgcttcag ggagacnSc dttfj. ggtctttatt tgaagatgaa 600 
cagggattag aaa ^ gacnccc catgaaggaa ttgccagcca caaaaaaatt 660 

<210> 33 

<2H> 673 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

i^i "1- «o, «3, 

<400> 33 

ggatrtgttg S?ct^??gg gScaS^c Itca^^T^ actgagagcc taagcatact 60 
gaaggttgaa aggagcaggg aaaagatcca gaSS^S '^T^^^^ aaattataaa 120 
tottgaagta tgatgcatat tgcattaS ?at?^^!" agttcgacat catcatcttt 180 
atcatttaga agggcaagtt clagaggata ^qa^aat^^ ^^^^^aattg cagtctgagg 240 
tgactaaaaa tgaacattaa tgttnSga? Saa^J^^? agaacttttt aactattcat 300 
tgaaattatg caactttgat a?cata??cc ^^aa?tf !^=<=tgctgg cagtcccaaa 360 
gaaactttat aaagcatatg gtcagJtaJt JnK^aL^^ ttgggctttt gtgattgant 420 
ctgcacttaa agaagtctaa cagtlcaaat SJi^ f f ^fgcaaaacct gaaccacctt 480 
tntattttta aatattgtac ta?t?a?a;t naSf cttagatgga tntatttntt 540 
aatttatcat ttcaanggca ttcta?t?gg gtfttlll^ ttcttactaa tacacaaatn 600 
ttcgctactg tnt gtttagaagt tgattccaag nantgcatat 660 



<210> 34 

<211> 684 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_feature 

«3. 523, 5,4. 575, 5^, 

<223> n = A,T,C or G 
<400> 34 

actagtttat tcaagaaaag aacttactga ttoctctoti- n ^ 

tgatcagggc tggtgtagoa tccggttcct ttaSaof^ cctaaagcaa gagtggcagg 60 
gaccaaggag gaaatcacta agacatttaa ntt^^^^" ^aactgcatt tgtcactgat 120 
ccacagttct gagcottaac cJtgSgtt? SacaaL^ tatgaacgtt cttggaca% 180 
ttcaggagga atctgtgcgg atagat^ggc ?ggaS?t?^ acgagctcca cctccccttc 240 
?al^'''5!* ^tggctgggt atggagc^ga cagS^^a ^^^^^ttctg ggttgcaagt 300 
tgcctggttg gaaggtacag gtgttcagS crttSgSS ItlT^T'' *°^9°'=c:ggc 360 

ccctcggaaa aagggcataa agtngtgggg 420 
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gacaattctc agtccaagaa gaatgcattg accattgctg gctatttgct tncctagtan 480 
gaattggatn catttttgac cangatnntt ctnctatgct ttnttgcaat gaaatcaaat 540 
cccgcattat ctacaagtgg tatgaagtcc tgcnnccccc agagaggctg ttcaggcnat 600 
gtcttccaag ggcagggtgg gttacaccat tttacctccc ctctcccccc agattatgna 660 
cncagaagga atttntttcc tccc 684 

<210> 35 
<211> 614 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis Cofeature 

<222> 17, 20, 152, 223, 267, 287, 304, 306, 316, 319, 321, 355, 
365, 382, 391, 407, 419, 428, 434, 464, 467, 477, 480, 495, 
499, 505, 515, 516, 522, 524, 527, 542, 547, 549, 567, 572, 
576, 578 

<223> n = A,T,C or G 



<400> 35 

actagtccaa 

ggtaagatcg 

tcactgcatg 

cacacctcgc 

acggtttctc 

aagncncgtg 

ttccngtttc 

gaaggganga 

tgctttatgt 

gntcgancnc 

aaaaaaaaaa, 



cgcgttngcn 
agcaatggct 
aagactggct 
tccctgttag 
tgtggtcaat 
agcagncanc 
tcctggccct 
taantgggat 
ggganacana 
gtcttcgatt 
aaaa 



aatattcccc 
tcaggacatg 
tgtctcagtg 
tgccgtatga 
gttggtnggc 
nccagttctg 
gngtgggcta 
ctaccaattg 
tctanctctc 
ttcgganaca 



tggtagccta 
ggttctcttc 
tntcaacctc 
cagcccccat 
tgattggtgg 
caccagcagc 
nggcctgatt 
attctggcaa 
atttnntgct 
cnccantnaa 



cttccttacc 
tcctgtgatc 
accagggctg 
canatgacct 
aaagtanggt 
gcctccgtcc 
cgggaanatg 
aacnatntct 
gnanatnaca 
tactggcgtt 



cccgaatatt 60 
attcaagtgc 120 
tctcttggtc 180 
tggccaagtc 240 
ggaccaaagg 300 
tactngggtg 360 
cctttgcang 420 
aagattnttn 480 
ccctactcgt 540 
ctgttgttaa 600 
614 



<210> 36 

<211> 686 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_featiire 

<222> 222, 224, 237, 264, 285, 548, 551, 628, 643, 645, 665, 674 
<223> n = A,T,C or G 



<400> 36 

gtggctggcc 

ctccctcgtc 

taacctcggt 

gggcgggggc 

ctcagctcgc 

acctgctctg 

gcftatttctt 

gagactggat 

ggatgccacc 

ctcagganat 

ggatattatt 

aactnaaaca 



cggttctccg 
gactgttgct 
gccaccggat 
ctggagcagc 
cagtccggtc 
ggcacacgcg 
aatcagcgct 
tggaacattt 
aaagcagcta 
naattgataa 
atttgtttac 
aaanctaagg 



cttctcccca 
tgctggtcgc 
tgcccttctt 
ccgaggcact 
gctngcttcc 
acccgtggtt 
tgcaaagatg 
ttggggtcta 
cagcagctgc 
cctggctcat 
cggggganag 
aaatcc 



tcccctactt 

agactccctg 

ttcctgttgc 

gcagcagaag 

cgccgcatgg 

gatttggcct 

gttaacctat 

aaggtctgtt 

agatttcaca 

aacacattgt ■ 

gataactgtt 



tcctccctcc 
acccctccct 
ccagcccagc 
ananaaaaga 
caatnagaca 
tcagtggcat 
gctacgccag 
tggggtgcaa 
gcccaagtgt 
caagaatgtg 
tcncntattt 



ctccctttcc 60 
cacccctccc 120 
cctagtgtca 180 
cacgacnaac 240 
gacgccgctc 300 
cacccttatg 360 
ggagatacag 420 
cactgaataa 480 
gggatgctgt 540 
gatttcccca 600 
taattgaaca 660 
686 



<210> 37 
<211> 681 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> iriisc_feature 



<221> raise feature 

628, 634, 641, 645, 658, 670 ' ^l^' 

<223> n = A,T,C or G 

<400> 37 

ScL^cf S2r ^^^^-^^^^^ «o 

cancctgnat caatctganc tolTtcT, gcSScS aStSSS^ --ngactc 120 
aaaggtcgca cnnncagaga agctgctqcc ancao^»n^ acctcggagg tggangccgn 180 
nataggaaac tggtgaccL gctgS? tS^^f gccccnnccc tgncgggctn 240 
cacactgagt tnLgatgan gS?^^^ JSc^^o^ gcacgcgang ggcacnnnct 300 
tgcggaggaa ggaagacloc SSggSJ SagSacn T."'''^^^^^ annacnggac 360 
gattatnccc cttgactgag tctctalaaa tgccaccccc ccacccctag 420 

natnntgctc natcgggact aacana^?^^ gctacccgaa cccgcctcca ttccctacci 480 
tnanacLac ^cnacnga^ naJnSoaS '"^^^^^ ggctatcccc cancatcccc 540 

cggcgcnggc ct'tcggS gtISc a^^SS 'rlllZlTat 
ggactcctcn ttgttccctc c ^^cnaatrcc naaanggcgg gccccccngt 



<210> 38 

<211> 687 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



660 
681 



655, 674, 682 ' ' 626, 639, 

<223> n = A,T,C or G 



<400> 38 



c^^c^cggc:: gtS^g^a'^ ZttlZTo ^^ISJLT^ ^^^^^'^^^^ '^-^^^^^ 
gagggcggga cntgccgggg ccggagctca Saalo^^^ ^^^gctcccgc aagcggagga 120 
atcgcaaggg cggcgc^aac ctnaggcS SSSaaaa f ^^^^^^^^ ctctcccgcc 180 
gggggctgtg anaaccgcaa aaanaacact tttt^T^^ tccccnangc ggnggcggcg 240 
aaggananac ttccacagan gSaacrt?S ^^^f "=^^"9 cgaacccgtc cacccccgcg 300 
gcaccccagt aagttcc^gn cgg^galgct oaTcTclTc L'"'" -^agggtgat 360 
cggcgcacna aggggangan caccgctgtc aaaaaanctc ttcgctccac 420 

gcccgcccta ntltlttttt Sgaatctcc actt?^'?^" acaggtcatc tgatcacgtc 480 
ctccttgcgc cttcctctna S?tSnaac atlrif ^ accccaccog ccgttctctc 540 
gciicnngtng aaattaattc ggJccn™ aa?2nS° ^f^^^^*'^^ tanttnctct 600 
aactgctgtt ctgnttactg ?ng?ccc otgtggcaac tgctnaaaga 660 
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<210> 39 
<211> 695 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 300, 401, 423, 429, 431, 437, 443, 448, 454, 466, 492, 515, 
523, 524, 536, 538, 541, 552, 561, 566, 581, 583, 619, 635, 
636, 641, 649, 661, 694 
<223> n = A,T,C or G 

<400> 39 

actagtctgg cctacaatag 
tagaaaaacg tatacagatt 
tgacccctgc gctagactgt 
cttattagtt ataacatgat 
ccaaactttt ttttttaact 
gttgttatgg gtagaaaaaa 
ttagtttaaa attaggggta 
aanaaatcna ngaacangat 
atttgaaatc anacacggca 
ntccaaatct tnttggaaac 
tggaatggcc ctatttaang 
naatatatat ccttggtccc 

<210> 40 
<211> 674 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 403, 428, 432, 507, 530, 543, 580, 583, 591, 604, 608, 621, 
624, 626, 639, 672 
<223> n =.A,T,C or G 

<400> 40 

actagtagtc agttgggagt ggttgctata ccttgacttc atttatatga atttccactt 60 
tattaaataa tagaaaagaa aatcccggtg cttgcagtag agttatagga cattctatgc 120 
ttacagaaaa tatagccatg attgaaatca aatagtaaag gctgttctgg ctttttatct 180 
tcttagctca tcttaaataa gtagtacact tgggatgcag tgcgtctgaa gtgctaatca 240 
gttgtaacaa tagcacaaat cgaacttagg atgtgtttct tctcttctgt gtttcgattt 300 
tgatcaattc tttaattttg ggaacctata atacagtttt cctattcttg gagataaaaa 360 
ttaaatggat cactgatatt taagtcattc tgcttctcat ctnaatattc catattctgt 420 
attagganaa antacctccc agcacagccc cctctcaaac cccacccaaa accaagcatt 480 
tggaatgagt ctcctttatt tccgaantgt ggatggtata acccatatcn ctccaatttc 540 
tgnttgggtt gggtattaat ttgaactgtg catgaaaagn ggnaatcttt nctttgggtc 600 
aaantttncc ggttaatttg nctngncaaa tccaatttnc tttaagggtg tctttataaa 660 
atttgctatt cngg 674 

<210> 41 

<211> 657 

<212> DNA ■ 

<213> Homo sapiens 

<220> 

<221> misc feature 



tgtgattcat gtaggacttc tttcatcaat tcaaaacccc 60 
atataagtag ggataagatt tctaacattt ctgggctctc 120 
ggaaagggag tattattata gtatacaaca c±gctgttgc 180 
aggtgctgaa ttgtgattca caatttaaaa acactgtaat 240 
gtagatcatg catgtgaatg ttaatgttaa tttgttcaan 300 
ccacatgcct taaaatttta aaaagcaggg cccaaactta 360 
tgtttccagt ttgttattaa ntggttatag ctctgtttag 420 
ttngaaantt aagntgacat tatttnccag tgacttgtta 480 
ccttccgttt tggtnctatt ggnntttgaa tccaancngg 540 
ngtccnttta acttttttac nanatcttat ttttttattt 600 
ttaaaagggg ggggnnccac naccattcnt gaataaaact 660 
ccaaaattta aggng 695 
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50iM3f'5ll''Jf?'e?5f' "2, 

<223> n = A,T,C or G 



<400> 41 



^,'^2-0. .„ . 

accttgggac cctaatgggg caoaaaatat aar^^ tgccttctaa aggctgacat 120 
cctttgggag gctgaagJI aSgSg titgtltttt ^f^V^^^^^^ atgaccactc 180 
atnggtnaca ngatgttaaa ntaaqqntc? antS^o^^ ctcatggaag cagcacatga 240 
acacactcct anca^tggt aSaggStgc iTallalZt ^^'^^^^ tgaaaaantg 300 
agcatgggct gatctgatL cttcSggca ^cccgctS ll^T'^'' ^aaaaacatt 360 
naaggatggg ananttttcc atatccttgc tattaa^n^^ ttttatggga agtcttatta 420 
ccctctatta aaaatcactg nccttaSc aoS^^ ! f^^gaacact ctctaaattt 480 
tttctctgac ttagttct^g gcalgggSc SccSa!^ ^^^-^^9^00 540 

ttctccngaa ctcacctac? ?gaa???S Z^Tolll t^ga^lS^ a1a~ 
<210> 42' 
<211> 389 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc feature 
<222> 179,^317, 320 
<223> n = A, T, C^or G 

<400> 42 

'c^nTJ. SSSL^ i-tillUi llllT'''' r^'^^^-^ -^^"^"^ ^0 

caggaagaaa acaaaaaJca gactg?Sc cacaScir aattagaaga 120 

ggccttcacc gccaccaggg £gtcc3S aa^!^^ ^ aacctccgtt gtggcagang 180 
atcctgaaga attcctg??? ggJjgSgS aaoSfaat^ gactccagcc ttctgaggcc 240 
tgttgcctgc ccgcgtngtn ggSaggSc taaSt.^^ acccggattt aaaaagatgc 300 
atattttaag ttaagaaaaa SaSS ^^^'^^^^^^ gtgaatttct taaaagaaaa 360 

389 

<210> 43 
<211> 279 
<212> Mia 

<213> Homo sapiens 

<400> 43 

gSaig^:?: iiziiit gisst titiT^'t «o 

tactgtgtta gctctttgia igU^tS^ ^^^^^ ^5 tctagaactt gcatgacct? 120 
tgtccttatc attgtataaa agc^JSg tlc^lTatl acaaataata 180 

aataaaatac ttaaacactg aaaaSaaaf aaaaaSaa" ^^^^^^^^^^ tgtctgattt 240 

279 

<210> 44 
<211> 449 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

S23': 2^3, 281, 323, 325, 337, 393 
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<400> 44 

actagtagca tcttttctac aacgttaaaa 
caacaacaac aataacaata aatcctaagt 
atatcagcct gttttttccc ttttttctcc 
tctacagcct ctttcctctt ctcatgcttg 
aagantgggc tgtttngctt ggantncggt 
gttggaagaa actcaaacct tcnancccta 
atttttgtac tggcattaac aaaaaaagaa 
aactttaaaa gggaaaaaaa aaaaaaaaa 

<210> 45 

<211> 559 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 263 

<223> n - A,T,C or G 



ttgcagaagt agcttatcat taaaaaacaa 60 
gtaaatcagt tattctaccc cctaccaagg 120 
tgggaataat tgtgggcttc ttcccaaatt 180 
agcttccctg tttgcacgca tgcgttgtgc 240 
ccnagtggaa ncatgctttc ccttgttact 300 
ggtgttncca ttttgtcaag tcatcactgt 360 
atnaaatatt gttccattaa actttaataa 420 

449 



<400> 45 

actagtgtgg gggaatcacg gacacttaaa gtcaatctgc gaaataattc ttttattaca 60 
cactcactga agtttttgag tcccagagag ccattctatg tcaaacattc caagtactct 120 
ttgagagccc agcattacat caacatgccc gtgcagttca aaccgaagtc cgcaggcaaa 180 
tttgaagctt tgcttgtcat tcaaacagat gaaggcaaga gtattgctat tcgactaatt 240 
ggtgaagctc ttggaaaaaa ttnactagaa tactttttgt gttaagttaa ttacataagt 300 
tgtattttgt taactttatc tttctacact acaattatgc ttttgtatat atattttgta 360 
tgatggatat ctataattgt agattttgtt tttacaagct aatactgaag actcgactga 420 
aatattatgt atctagccca tagtattgta cttaactttt acagggtgaa aaaaaaattc 480 
tgtgtttgca ttgattatga tattctgaat aaatatggga atatatttta atgtgggtaa 540 
aaaaaaaaaa aaaaaggaa 559 

<210> 46 

<211> 731 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 270, 467, 477, 502, 635, 660, 671, 688, 695, 697, 725 
<223> n « A,T,C or G 



<400> 46 

actagttcta gtaccatggc tgtcatagat 
tcaggttccc taacaattgt ttgaaactga 
actgtcatgt atatggtgta tatgggatgt 
tatacatatg catatatatg tataatatac 
catatatata cacatatatg cacacatatn 
ggggcaattg tattctctcc ctctgtctgc 
cttgatttcc tttggataag agtcttatct 
gatttctatt ccagaatacc tctcatatct 
ataagattgt agtatgaaag antttgctta 
ctacaaatta aattgtaaaa tgatggtttg 
atgtaactgg gtacctgtta tatcaaagaa 
atccttatat ngccctctct gacctgantt 
taggnttggg c 



gcaaccatta tattccattt agtttcttcc 60 
atatatatgt ttatgtatgt gtgtgtgttc 120 
gtgcagtttt cagttatata tatattcata 180 
atatatacat gcatacactt gtataatata 240 
atcactgagt tccaaagtga gtctttattt 300 
tcactgggcc tttgcaagac atagcaattg 360 
tcggcactct tgactctagc cttaacttta 420 
atcttaaaac ctaaganggg taaagangtc 4 80 
gttaaattat atctcaggaa actcattcat 540 
ttgtatctga aaaaatgttt agaacaagaa 600 
cctcnattta ttaagtctcc tcatagccan 660 
aatananact tgaataatga atagttaatt 720 

731 



<210> 47 
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<211> 640 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 



33S. 351. 365, )28 441 ul' ff!' 337, 

"^3>=r: s iii- 



<400> 47 



cSS IT^'^l :^"2t-S '^^""^^^^ ^^"cccgat 60 

gtacaccaaa tgtgaciS ^tScca:? a?n'SSct ^'J"^*"^ ttgcctaaS llo 
anacgactnc aacaattttt tgatnaccS aaana^^^ ^^^^^^^^^ tcntcnatgg 180 
ggagcagcat ggacctgtcn gcnactaan^ tit ggctnnaana agtaoantct 240 

ttggtatgtc ttactgaaag InT.TalTat gcttJS^ "^^^^^-^t tacacaacct 300 
oaganattgc caatgccaag tccgagSggt JaStS^^? ctagaccacg aggncaaccg 360 
tacatacntt gtccccgaaa nanIaga?S oSaa^^S aatacattcc atggatgcat 420 
acanctacac ctggtgcttg ganaacanac Sf^^' tcttcanact ggtccngaaa 480 
cccagtgggt tttnccttgg L^ott aSanS fj^^^^^^^fc acaagttccc 540 
ntggcnttnt nttgggacca ntcttcJcac aaS^aSc attctttgcc 600 



<210> 48 
<211> 257 
<212> DMA 
<213> Homo sapiens 

<400> 48 



tgattttctt tgttcctgL aaaSaSttJ ^.'f gcataatcac attttcLaa 120 

ttatatttgt atatgtaLa tcSf^a^^ tJt^StaL '"ff^^^^ ttttggaaga 180 
aaaaaaaaaa aaaaaaa ^^^^ta tttaaataaa aagtatcttt agagtgaaaa 24 0 

257 



<210> 49 
<211> 652 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

^^6. 571, 647 
<223> n = A,T,C or G 

<400> 49 



t:l:ill:it as iT^^lllll -^ttcatta taacccaatt 60 

gttgacacta gaaactgccc a?ttSg?2 tloTa^JT. "f^^^"^ gcacagttaa 120 
tggggaaaaa aatcttattt taaaatggSt tagaaStt !f ^^gaaac attggaaaga ISO 
taaacttott tctgtttcca aaacttqiaa atf^.f! ^ tcagattact ttgaaaattc 240 
ttttcaaagc tttcctcaca tttttlf^gt gtga?tt?cc ISllTr ^"^^'--^g 300 
ttotttaaag cagctatatc ccaacccata alltlll:^ ttttaatata catatttatt 360 

.«,c„„o 
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cgcataactg cacaaatgaa cagtgtatac ctcttggttg tgcattnacc cc * 652 

<210> 50 
<211> 650 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^f eature 

<222> 237, 270, 311, 443, 454, 488, 520, 535, 539, 556, 567, 594, 
603, 634 

<223> n =^ A, T,C or G 



<400> 50 

ttgcgctttg atttttttag ggcttgtgcc ctgtttcact tatagggtct agaatgcttg 60 
tgttgagtaa aaaggagatg cccaatattc aaagctgcta aatgttctct ttgccataaa 120 
gactccgtgt aactgtgtga acacttggga tttttctcct ctgtcccgag gtcgtcgtct 180 
gctttctttt ttgggttctt tctagaagat tgagaaatgc atatgacagg ctgagancac 24 0 
ctccccaaac acacaagctc tcagccacan gcagcttctc cacagcccca gcttcgcaca 300 
ggctcctgga nggctgcctg ggggaggcag acatgggagt gccaaggtgg ccagatggtt 360 
ccaggactac aatgtcttta tttttaactg tttgccactg ctgccctcac ccctgcccgg 420 
ctctggagta ccgtctgccc canacaagtg ggantgaaat gggggtgggg gggaacactg 480 
attcccantt agggggtgcc taactgaaca gtagggatan aaggtgtgaa cctgngaant 540 
gcttttataa attatnttcc ttgttanatt tattttttaa tttaatctct gttnaactgc 600 
ccngggaaaa ggggaaaaaa aaaaaaaaat tctntttaaa cacatgaaca 650 

<210> 51 

<211> 545 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 66, 159, 195, 205, 214, 243, 278, 298, 306, 337, 366, 375, 
382, 405, 446, 477, 492, 495, 503, 507, 508, 521, 537 
<223> n = A,T,C or G 



<400> 51 

tggcgtgcaa ccagggtagc tgaagtttgg gtctgggact ggagattggc cattaggcct 60 
cctganattc cagctccctt ccaccaagcc cagtcttgct acgtggcaca gggcaaacct 120 
gactcccttt gggcctcagt ttcccctccc cttcatgana tgaaaagaat actacttttt 180 
cttgttggtc taacnttgct ggacncaaag tgtngtcatt attgttgtat tgggtgatgt 240 
gtncaaaact gcagaagctc actgcctatg agaggaanta agagagatag tggatganag 300 
ggacanaagg agtcattatt tggtatagat ccacccntcc caacctttct ctcctcagtc 360 
cctgcncctc atgtntctgg tntggtgagt cctttgtgcc accanccatc atgctttgca 420 
ttgctgccat cctgggaagg gggtgnatcg tctcacaact tgttgtcatc gtttganatg 480 
catgctttct tnatnaaaca aanaaannaa tgtttgacag ngtttaaaat aaaaaanaaa 540 
caaaa 545 

<210> 52 

<211> 678 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 98, 119, 121, 131, 136, 139, 140, 142, 143, 163, 168, 172, 
176, 184, 189, 190, 191, 200, 201, 205, 207, 221, 223, 229, 
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<223> n « A,T,C or G 
<221> misc_feature 



<221> misc_feature 

<1l3>'f : lf^;/2',"^- «°. "S- 

<400> 52 

iSi Eii iiS iiir? M So 

tggnccccnn nccctoJcS^ S^tnS^^ ^^^^^'f"^ tcncattnga anttantccc 180 
ncttccccat ctccntcccc S^SS^rt! ^T'"'''"^'' "*='^<^'=tccnn ctttttntan 240 
nctccncncc tccnScgtt cScntn^^" ""^^^^'^'^^ ncacttnctc 300 

annctctccc cnctgcaanc St^^;J^! cnacntntnc ncnnntnccn tgccnntnaa 360 
ncncgctcct ntTc^^l ScSScn t^T^"" ctntccacto cntnottctc 420 
cgnntcnttn nnntcS^ ^nccc^cc tS^ff'"'''' cantacnctc nccncccttn 480 
tctctcccnc nncncnncS? ScccnJcc nngSnc^t tt1?'°'°° «cggtntntc 540 
ccttcntcnc cantccatcn ™i-n^^n^!^=r ""^''^"^""t ttccgccccn cnconccntt 600 
ntctctttca ^cngtcc nctncctncc nctcacnocc gctncccccn 660 

678 

<210> 53 

<211> 502 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> niscfeature 

mf^elf'jr' 386, 420, 452, 457, 461, 

<223> n = A,T,C or G 

<400> 53 

c!:Sg?ac IttlllZ llllT'^l ^0 

tgacctgggg cggaaaaang ^aaaaS tgaScca 120 
agatcaatat gagcagctgt cctctaaaon lllll I ctttgtggcc acatggtgtc 180 
gtacatggta aaaagL^ gcnaaS tt=c^^=? gcccgaattt gtgccaataa 24 0 
cacgtcatcc gcatLaSI gS^^gSc tStJc^aaaa ^^^^^^^ ccaccccttc 300 
atgcgaagtg cctttggaaa IccSnSS ^llTalll^ ctgacaggct cccaacaggo 360 
atcatgttca tccacaccaa ctgtggccag ggttcacatt gggccaattn 420 

gncaanttca aaS^^cgg f^''^^^^'^ angaacntgt naattnaagc cctgocoagg 48? 

502 

<210> 54 

<211> 494 

<212> DNA 

<213> Homo sapiens 



wo 02/00174 



PCTAJSOl/21065 



23 



<220> 

<221> ndsc^feature 
<222> 431, 442, 445 
<223> n = A,T,C or G 



<400> 54 

actagtccaa gaaaaatatg cttaatgtat attacaaagg ctttgtatat gttaacctgt 60 
tttaatgcca aaagtttgct ttgtccacaa tttccttaag acctcttcag aaagggattt 120 
gtttgcctta atgaatactg ttgggaaaaa acacagtata atgagtgaaa agggcagaag 180 
caagaaattt ctacatctta gcgactccaa gaagaatgag tatccacatt tagatggcac 240 
attatgagga ctttaatctt tccttaaaca caataatgtt ttcttttttc ttttattcac 300 
atgatttcta agtatatttt tcatgcagga cagtttttca accttgatgt acagtgactg 360 
tgttaaattt ttctttcagt ggcaacctct ataatcttta aaatatggtg agcatcttgt 420 
ctgttttgaa ngggatatga cnatnaatct atcagatggg aaatcctgtt tccaagttag 480 
aaaaaaaaaa aaaa 494 

<210> 55 

<211> 606 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 375, 395, 511, 542, 559, 569, 578, 581 
<223> n =* A,T,C or G 



<400> 55 

actagtaaaa agcagcattg ccaaataatc 
gatgttaagc tttttgaaaa gtttaggtta 
tgcttccctt tatctggaat gtggcattag 
ttcaattcca tgacttaagg ttggagagct 
cagttttgca taattataat cggcattgta 
atctgcactt tctaaatatc aaaaaaggga 
tgtttgaaac atgantttta tttgcttaat 
ttgggatcct gtgtaaaact gttctcatta 
actagctaca aattccgttt catattctac 
anatggtcta cttctgtcnt ataaaaacna 
aaaaaa 



cctaattttc cactaaaaat ataatgaaat 60 
aacctactgt tgttagatta atgtatttgt 120 

cttttttatt ttaaccctct ttaattctta 180 
aaacactggg atttttggat aacagactga 240 
catagaaagg atatggctac cttttgttaa 300 
aatgaagtat aaatcaattt ttgtataatc 360 
attanggctt tgcccttttc tgttagtctc 420 
aacaccaaac agttaagtcc attctctggt 480 
ntaacaattt aaattaactg aaatatttct 540 
aacttgantt nccaaaaaaa aaaaaaaaaa 600 

606 



<210> 56 

<211> 183 

<212> DNA 

<213> Homo sapiens 



<400> 56 

actagtatat ttaaacttac aggcttattt 
aattaacatg gttataatac gtacaatcct 
gtgtgataaa ctgattttgg tttgcaataa 
aaa 

<210> 57 

<211> 622 

<212> DNA 

<213> Homo sapiens 



gtaatgtaaa ccaccatttt aatgtactgt 60 
tccctcatcc catcacacaa ctttttttgt 120 
aaccttgaaa aataaaaaaa aaaaaaaaaa 180 

183 



<220> 

<221> misc_feature 

<222> 358, 368, 412, 414, 425, 430, 453, 455, 469, 475, 495, 499, 
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529, 540, 564, 575, 590 
<223> n = A,T,C or G 

<400> 57 



SSSga TXZl gttlcSS L^ccT'^^ "-^^^^^^ ^0 

aatcagtgag cactgttc?g 2?agagSc SSSo' "^^^^^^^^^ gaaagaggat 120 
ctgggtcaaa gctgcatgal accaggccct aacaan^f ^^^^<=c=ta ggatccagga 180 
agagaacctg acttctc?tt ccc???cc2 cctccaSSt tfl^*^'^ t^g-ggtggg 240 
agggatcttc tgagcttgtt tccctactaa tactggaact ctatcctgtt 300 

tctacaanaa gca^ccct'tc tttgSctc? SgS^ ll^T^'t ^^^^^-^^ 360 
gaganaccan aagcctctga tttttaat^^ lltr agcttgacct ananttcatg 420 

atatatattt ct?tnaatn? SgagJcJS gSSS? "^T^'"' atatntaca? 480 
gaaacctgaa ttaaaaccat gal^aaaat aSfnS^ aaaatccant ccctctgccn 540 
aaacttgaaa aaaaaaaaaa J™^^^^^ gtttncctta aagatgttan taattaattg 600 

622 

<210> 58 

<211> 433 

<212> DNA 

<213> Homo sapiens 



<400> 58 



g"S;:S X?a^S lT.t^itiTt Tainr'^ :r^"^- «o 

tcctttcagc tgccagtgtt gaJS^aS f^ll ttgctcatat agtaaaggga 120 

. accagcttta agctgaaSa ?tt^atga^t laalttT^ tgatgttatc tgtgacagtc 180 
catatttgtg actt?aatcg tgcSc^tJg aSaJaSt Sf""^'^ ^t^ctgaaaa 240 
tgacagtaaa cctgtccatt a?gaa?gg?? ?ac?St!^^ ttttactggt tcttctgaat 300 
ttatccacca aagacttcat tt?tata?ca J^! f ^ ttatttgttt tgacttgaat 360 
aaaaaaaaaa aaa '^tgtgtatca tcaataaagt tgtatgtttc aactgaaaaa 420 

433 

<210> 59 
<211> 649 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_feature 

<m> i^i:c'^l:"°- »°. «3, «4 

<400> 59 

actagttatt atctgacttt cnggttataa tcattctaa^ 

tgtcatttgg atttgcattt ctctgatgag taa^aS^^^ gagtgtgaag tagcctctgg 60 
ttggccatat gtgtatgttc cctggagSg ^gSSaS f " gctggtgctg 120 

attaggcgtn tgtcttttta ttacigjgt? g?aaa2?S tttTl'^^^ ccacttttta 180 
gacccttatc agatacatgg tttgcaJSa ?t??S^» f^att ctggattcta 240 

ctttatcgat aatgtcctta gacata^aa? aaatS^=^ S'^S^St tgtgttttca 300 
ggctgtgcaa ggtgggctca cgcttgtaat cccaJ^fcS "taaaagtg acttgatttg 360 
ataatatgan gangctagga g?tcgagg?c Zallaatl ^^^f ^^^^g aggtgggtgg 420 
tacnaaaaat aoaaaaatta gtcaggjatg g?SSa™ f^^.t^''^^ aacttgtctc 480 



tacnaaaaat acaaaaatta gtcagaclta atn^l^^''"^ gcatagcgaa aacttgtctc 480 
ggangctgan gcacaaggat Lct??aacc SS^f! tctgtaatac cagcttctca 540 
atcatgccag ggcaacaaaa atgagLott gt't^S^: lllt^elZ' '^^'^^^^^^^ 

<210> 60 
<211> 423 
<212> DNA 
<213> Homo sapiens 



649 
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<220> 

<221> misc^feature 

<222> 209, 222, 277, 389, 398 

<223> n = A,T,C or G 

<400> 60 

actagttcag gccttccagt tcactgacaa 
acctggcagt gataccatca agcctgatgt 
gaagtgagcg ctgggctgtt ttagtgccag 
tcttctgtat tttttttttc cattagtana 
tgtcttacaa. ggcagggctt tcctacaggg 
aggaatggcc tgagttggcg ttgtgggcag 
caacccatta atcttttgta gtttgtatna 
aaa 

<210> 61 

<211> 423 

<212> DNA 

<213> Homo sapiens 



acatggggaa gtgtgcccag ctggctggaa 60 
ccaaaagagc aaagaatatt tctccaagca 120 
gctgcggtgg gcagccatga gaacaaaacc 180 
acacaagact cngattcagc cgaattgtgg 240 
ggtgganaaa acagcctttc ttcctttggt 300 
gctactggtt tgtatgatgt attagtagag 360 
aacttganct gagaccttaa acaaaaaaaa 420 

423 



<220> 

<221> misc_feature 

<222> 195, 285, 295, 329, 335, 340, 347, 367, 382, 383, 391, 396, 
418 

<223> n A,T,C or G 
<400> 61 

cgggactgga atgtaaagtg aagttcggag ctctgagcac gggctcttcc cgccgggtcc 60 
tccctcccca gaccccagag ggagaggccc accccgccca gccccgcccc agcccctgct 120 
caggtctgag tatggctggg agtcgggggc cacaggcctc tagctgtgct gctcaagaag 180 
actggatcag ggtanctaca agtggccggg ccttgccttt gggattctac cctgttccta 240 
atttggtgtt ggggtgcggg gtccctggcc cccttttcca cactncctcc ctccngacag 300 
caacctccct tggggcaatt gggcctggnt ctccncccgn tgttgcnacc ctttgttggt 360 
ttaaggnctt taaaaatgtt annttttccc ntgccngggt taaaaaagga aaaaactnaa 420 
aaa 423 

<210> 62 

<211> 683 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 218, 291, 305, 411, 416, 441, 443, 453, 522, 523, 536, 542, 
547, 566, 588, 592, 595, 603, 621, 628, 630, 632, 644, 645, 
648, 655, 660, 672, 674, 676, 677, 683 
<223> n = A,T,C or G 

<400> 62 

gctggagagg ggtacggact ttcttggagt tgtcccaggt tggaatgaga ctgaactcaa 60 

gaagagaccc taagagactg gggaatggtt cctgccttca ggaaagtgaa agacgcttag 120 

gctgtcaaca cttaaaggaa gtccccttga agcccagagt ggacagacta gacccattga 180 

tggggccact ggccatggtc cgtggacaag acattccngt gggccatggc acaccggggg 240 

ggatcaaaat gtgtacttgt ggggtctcgc cccttgccaa aaccaaacca ntcccactcc 300 

tgtcnttgga ctttcttccc attccctcct ccccaaatgc acttcccctc ctccctctgc 360 

ccctcctgtg tttttggaat tctgtttccc tcaaaattgt taatttttta nttttngacc 420 

atgaacttat gtttggggtc nangttcccc ttnccaatgc atactaatat attaatggtt 480 
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atttattttt gaaatatttt ttaatqaaci- i-nrr^=.^ 

cnttttnttt ggggggggtg gggggntggj SSaattS J^^^^^^^^^^ tccttncttc 540 

ttnttacttg gggcccccct naaaaaan?? aSfc^^JJ tttggaancc cnatnggaaa 600 

ctaaaaaaaa ananannaaa aan anttccaatt cttnnatngc ccctnttccn 660 



<210> 63 

<211> 731 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



f r,»^;':^;.%^!f'j!f-,i'f'jf .,3" 3.. 33,, 3.2. 3«, 3,0, 

502, 512, 531, 546, 554 563 til' ill' t^' 498, 

SLT: t^l:^^r- 

<221> misc^feature 

""M?';!?:',!??',!!" 717, n., 

<223> n « A,T,C or G 
<400> 63 

sss; ssss ?M? ~2 ss"'"" " 

acggatccgc tcgcgccacg gcctcatcca tS^-^! f ''^'^cgcgcgg tggcgagttc 120 
ccgcgagctc accgcatgcc cttcttggal gSgSaSc tT'Tj"" ^^g^-^aga 180 
gaaggcgtng ggggcccgca aantacScg ?Sacan ^"^"^^ cgcccanaaa 240 
taatattggt tnaaaanctg canaan^S cctSn^n tatggaangt cctcttgcaa 300 
cncttacctn gtttggntgc ggttac^Sg aacSttn lll'"^"^^^. S^tgcagggc 360 
ttccgggaaa attntncaaa tttttnttg? aaaat?^^^ ggaaaaccct nocnaaaacc 420 
gaaacntttt tgccctnnaa antaaacca? taaa^Jf ^ ggtaaacccc ccnaaaatgg 480 
tttttntttt tttntgcccc can^nncccc llaall^^ gggccccccc ncaaaaccct 540 
cccccctncc nanantttta aaagg^ggg SaSt^tn "=""""59 ggaaaanccc 600 
ggngntaaaa nggtttcnco occccga|gg ZaaS^^n """^^^^'^^ gggnoccccn 660 
ccncnttttn n ^^^^ gnggggnnnc ctcnnaaacc cntntcnnna 720 



<210> 64 

<211> 313 

<212> D^^A 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 240 

<223> n - A, T, C or G 

<400> 64 



gttagagatg gttgctacS S^ggS tSaS^Sf g^f-taact tgcaacgtct 60 
taaagttgat agagaatatg aa?aa?gca? g?caaaaaa^ ^^ragg agcagattgc 120 

gattagagat aagtatgaga agaaag?Sc ?cta!tt^»^ ctctcggaaa atattaaaga 180 
aaatgttgat catgtataL tatccItaS gaataaaat? ^^^^^^^ aatgaagatn 240 
aaaaaaaaaa aaa . ^^^^aaaatt gtctcagtaa agttgtaaaa 300 

313 

<210> 65 



<211> 420 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<222> 400, 402, 403, 404, 405, 406, 409, 411, 412, 414, 415, 416 
<223> n = A,T,C or G 

<400> 65 

actagttccc tggcaggcaa 
caggaagctg gcagtggcag 
tctgggaggt tggagggaag 
gtagatactg ccttaacact 
ctccgtgctc actaatttat 
atttgtttta acattttcat 
acacaaatta atgatattaa 



<210> 66 
<211> 676 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 328, 454, 505, 555, 586, 612, 636, 641 
<223> n = A,T,C or G 

<400> 66 

actagtttcc tatgatcatt 
cctcaatttg tacttcatca 
aaataaactc acaaatctgg 
aacaaggaat aatcccacaa 
actgttttta aggatttgcg 
gtagttttta aatgtgagct 
gttagaaacc tgttctcttg 
actccagccc attgcaaagt 
ctttttaaga aaaaattgga 
tttttggtga aaaanaatca 
ccctggaaaa anttgttaat 
ttaaagggaa aactta 

<210> 67 
<211> 620 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 419, 493, 519, 568, 605, 610 
<223> n = A,T,C or G 

<400> 67 

caccattaaa gctgcttacc aagaacttcc ccagcatttt gacttccttg tttgatagct 60 

gaattgtgag caggtgatag aagagccttt ctagttgaac atacagataa tttgctgaat 120 

acattccatt taatgaaggg gttacatctg ttacgaagct actaagaagg agcaagagca 180 

taggggaaaa aaatctgatc agaacgcatc aaactcacat gtgccccctc tactacaaac 240 

agattgtagt gctgtggtgg tttattccgt tgtgcagaac ttgcaagctg agtcactaaa 300 



gggcttccaa ctgaggcagt gcatgtgtgg cagagagagg 60 
cttctgtgtc tagggagggg tgtggctccc tccttccctg 120 
aatctaggcc ttagcttgcc ctcctgccac ccttcccctt 180 
ccctcctctc tcagctgtgg ctgccaccca agccaggttt 240 
ttccaggaaa ggtgtgtgga agacatgagc cgtgtataat 300 
tgcaagtatt gaccatcatc cttggttgtg tatcgttgta 360 
aaagcatcca aacaaagccn annnnnaana nnannngaaa 420 



aaactcattc tcagggttaa 
ataagttttt gaagagtgca 
atgcatttct aaattctgca 
tatacctagc tacctaatac 
cttacttgtg gctgaggaaa 
tatagatngg aaacagaata 
ttatctgaat cttgattgca 
ctcagatatc ttanctgtgt 
gtttnaaaga aataaacccc 
tcccgcaggg cttattgttt 
taaatgggga aaatgntggg 



gaaaggaatg taaatttctg 60 
gatttttagt caggtcttaa 120 
aatgtttcct ggggtgactt 180 
atggagctgg ggctcaaccc 240 
aataagtagt tccgagggaa 300 
tcaacttaat tatggaaatt 360 
attactattg tactggatag 420 
agttgaattc cttggaaatt 480 
tttgttaaat gaagcttggc 540 
aaaaanggaa ttttaagcct 600 
naaaaattat ccgttagggt 660 

676 
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cccaaagaga ggaaattata ggttagttaa ar;.+.^^o=*. 
cacttttgaa gtgttttgtt ttttaSttt StS^g a^'S^f 
ctaaaaaaaa agggatatca atctctaatt caata™ f^^tactttg ggggaaaang 420 
tctttactgg aanttatggg acttttLag c^™^^^ ttTT^f ccctaaaaag 480 
ttgcatgggc cccttaaaat tgttgaangg SttccS^ J^^^^^^^^^ caaattaapc 540 
ccccnttttn aaaatttgga cattcctgcc tctaagtttg gggaaaattc 600 

620 

<210> 68 
<211> 551 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

"e. 53,. =3S. 

^^^^> n = A, T,C or G 
<400> 68 

sSai fsss itiiTciiit :i:ih:t 

ttagaacaga cctctctgtg caSSrtJa taaof f ° '^"agctgta agagtctggc 120 
gtattggggt tgcaatgac^ cccaaggS alllalllft ^^^^^^^^^gg gccggcattt 180 
tctgagactg tggtgaaact ccttSaaaa cta^^^^ ! aaggcacgac tgggatttct 240 
actcggcacc actt£gatat ^caacaaacc ^^^f^^^ggt cagtangtgc tctgggaggg 300 
tacagctgat ggaac?caat tt^aacStc aallltT^ -aattataaa attgtta??? 360 
ttaaacctaa ttacatttgt otTgolllTa atttaatlrl tattatattg 420 

cctatgtgct cccctccccc imatcttaa? ^J«f^ tgtngcatat gtttttttcn 480 
nannnannna a nnatcttaat ttaaaccnca attttgcnat tcnccnnnnn 540 

551 

<210> 69 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 
<222> 235, 310, 323, 381 
<223> n = A,T,C or G 

<400> 69 

Tc^iiT^iit iit^.TaT.i ^1^— «o 

gtatgtggga tattgaatgt taaajaa^^ ^^^^^^^^^ ggtaatcccc tcaaataaca 120 
aattaagcaa atgttaaalg tttlllllll lllltllTa JtS""^' aattgtacaa 180 
tgtgatacat tttttaagct tcaati-o^?^ f ? ttttcaaaag gtatnataca 240 

ggggagccan aaaccaaL^ acnScS^ tttJttaf ^''^^^^^^^ atgggctttt 300 
aaaaataaat aaaaactatt nagaaattj^ aaaSa ^^rgacatgca ataaaattta 360 

^ 396 
<210> 70 

<211> 536 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 
<222> 388, 44 6, 455 
<223> n = A,T,c or G 
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<400> 70 

actagtgcaa aagcaaatat aaacatcgaa aaggcgttcc tcacgttagc tgaagatatc 60 

cttcgaaaga cccctgtaaa agagcccaac agtgaaaatg tagatatcag cagtggagga 120 

ggcgtgacag gctggaagag ca'aatgctgc tgagcattct cctgttccat cagttgccat 180 

ccactacccc gttttctctt cttgctgcaa aataaaccac tctgtccatt tttaactcta 240 

aacagatatt tttgtttctc atcttaacta tccaagccac ctattttatt tgttctttca 300 

tctgtgactg cttgctgact ttatcataat tttcttcaaa caaaaaaatg tatagaaaaa 360 

tcatgtctgt gacttcattt ttaaatgnta cttgctcagc tcaactgcat ttcagttgtt 420 

ttatagtcca gttcttatca acattnaaac ctatngcaat catttcaaat ctattctgca 480 

aattgtataa gaataaaagt tagaatttaa caattaaaaa aaaaaaaaaa aaaaaa 536 

<210> 71 

<211> 865 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 22, 35, 39, 56, 131, 138, 146, 183, 194, 197, 238, 269, 277, 
282, 297, 316, 331, 336, 340, 341, 346, 349, 370, 376, 381, 
382, 392, 396, 397, 401, 433, 444, 445, 454, 455, 469, 472, 
477, 480, 482, 489, 497, 499, 511, 522, 526, 527 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 545, 553, 556, 567, 574, 580, 610, 613, 634, 638, 639, 663, 
672, 689, 693, 694, 701, 704, 713, 723, 729, 732, 743, 744, 
749, 761, 765, 767, 769, 772, 774, 780, 783, 788, 792, 803, 
810, 824, 840, 848 
<223> n « A,T,C or G 

<400> 71 

gacaaagcgt taggagaaga anagaggcag ggaanactnc ccaggcacga tggccncctt 60 

cccaccagca accagcgccc cccaccagcc cccaggcccg gacgacgaag actccatcct 120 

ggattaatct nacctctntc gcctgnccca ttcctacctc ggaggtggag gccggaaagg 180 

tcncaccaag aganaanctg ctgccaacac caaccgcccc agccctggcg ggcacganag 240 

gaaactggtg accaatctgc agaattctna gaggaanaag cnaggggccc cgcgctnaga 300 

cagagctgga tatgangcca gaccatggac nctacncccn ncaatncana cgggactgcg 360 

gaagatggan gacccncgac nngatcaggc cngctnncca nccccccacc cctatgaatt 420 

attcccgctg aangaatctc tgannggctt ccannaaagc gcctccccnc cnaacgnaan 480 

tncaacatng ggattanang ctgggaactg naaggggcaa ancctnnaat atccccagaa 540 

acaanctctc ccnaanaaac tggggcncct catnggtggn accaactatt aactaaaccg 600 

cacgccaagn aantataaaa ggggggcccc tccncggnng accccctttt gtcccttaat 660 

ganggttatc cnccttgcgt accatggtnc ccnnttctgt ntgnatgttt ccnctcccct 720 

ccncctatnt cnagccgaac tcnnatttnc ccgggggtgc natcnantng tncncctttn 780 

ttngttgncc cngccctttc cgncggaacn cgtttccccg ttantaacgg cacccggggn 840 

aagggtgntt ggccccctcc ctccc 865 

<210> 72 

<211> 560 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 83, 173, 183, 186, 209, 211, 215, 255, 321, 322, 323, 335, 
344, 357, 361, 368, 394, 412, 415, 442, 455, 469, 472, 475, 
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<223> n « A,T,C or G 
<400> 72 



aaaagacagt gtccagtgct ccngcctjgg aa^S cggcgacgtc tcttttgact 60 
ccatgcccaa cttctctggc aac?ggaaS JcatcS^ gaccgcctcc cgcgccgcca 120 
tcnaantgct gggggtgaat gtga^Ltna noainf^f ggaaaacttc gangaattgo 180 
cagcagtgga gatcnaacag gagggSca ScS^^^ tgtggctgca gcgtccaagc 240 
gcaccacaaa gattaacttc nn^S^Saa aaSnJf? caaaacctcc accaccgtgc 300 
ngcctgtnaa aacctggtga aatgggSS t^S! ggancaaact gtggatngga 360 
cctgaaagga gaaggccccc anaaScSg Sccnaa^f atggtctgtg ancanaaact 420 
actgatnctt gaaccctgaa cgggcgggat actgacccnc cnatngggga 480 

tttccntttc cccaaaaaaa ^anccttttt tnttgccncc naangggttc 540 



<210> 73 

<211> 379 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



inf^f2/L";J:'l2l: i1; 81, 102, 104, 

214, 215, 219, 220 235 237' lA' lit' 189, 190, 

319, 322, 343, 353, 354' ' ^02, 310, 313, 



<223> n - A,T,C or'c 
<400> 73 



aX'c^:^^^^^ n'a'a^^^^^^^^^^^^ T^l^^l^T^ aT^^^^^^^ ^^^^^-^ — ctga 60 
gnanngagga acanaacaaa SSangJ^? tngngctttc nngnacaaS 120 

ttggccacnn gtggaattaa gaaatc^ggc aaam^^^n f ^^^^^^^^ gaaggggccc 180 
ataagngacc ctttatttca tctgtat??a aacST ^^^^^^^ gcctnangag 240 
tnccacgtan agntggaant antgttgtc tJ^aaS^!^ ttccctgnca taacttcttt 300 
ttgttcaaaa aaaaaataa ^tggactgtt gtncatttta gannaaactt 360 



<210> 74 

<211> 437 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 145, 355 
<223> n = A,r,C or G 

<400> 74 



lTdT.Z llimitT. c^s^ta^s: -^T-- - 

-:a1-S SXfg :--.c ™- -aaa^^^^^^ j.j 

aatcactgaa ttgtcagg^ tS^ttaat^ 2^^^^^" atttggcttt cfcttcJtct 24S 
gaataagtta taatcagLt tcttlttltt ollfttlT ^^^^^^cct tctgttgtgg 300 
gtcatttgta ctgtttgaaa aatatttct^ Sa2£2t ^'^^^""^'^ ctctna^tg 360 
aaaaaaaaaa aaaaaaa ctataaaatt aaactaacct gccttaaaaa 420 

<210> 75 
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. <211> 579 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 440, 513, 539, 551 
<223> n « A,T,C or G 



<400> 75 

ctccgtcgcc gccaagatga tgtgcggggc gccctccgcc acgcagccgg ccaccgccga 60 

gacccagcac atcgccgacc aggtgaggtc ccagcttgaa gagaaagaaa acaagaagtt 120 

ccctgtgttt aaggcqgtgt cattcaagag ccaggtggtc gcggggacaa actacttcat 180 

caaggtgcac gtcggcgacg aggacttcgt acacctgcga gtgttccaat ctctccctca 240 

tgaaaacaag cccttgacct tatctaacta ccagaccaac aaagccaagc atgatgagct 300 

gacctatttc tgatcctgac tttggacaag gcccttcagc cagaagactg acaaagtcat 360 

cctccgtcta ccagagcgtg cacttgtgat cctaaaataa gcttcatctc cgggctgtgc 420 

ccttggggtg gaaggggcan gatctgcact gcttttgcat ttctcttcct aaatttcatt 4 80 

gtgttgattc tttccttcca ataggtgatc ttnattactt tcagaatatt ttccaaatna 540 

gatatatttt naaaatcctt aaaaaaaaaa aaaaaaaaa 579 

<210> 76 

<211> 666 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 411, 470, 476, 491, 506, 527, 560, 570, 632, 636, 643, 650, 
654, 658 

<223> n = A,T,C or G 



<400> 7 6 

gtttatccta tctctccaac cagattgtca gctccttgag ggcaagagcc acagtatatt 60 
tccctgtttc ttccacagtg cctaataata ctgtggaact aggttttaat aattttttaa 120 
ttgatgttgt tatgggcagg atggcaacca gaccattgtc tcagagcagg tgctggctct 180 
ttcctggcta ctccatgttg gctagcctct ggtaacctct tacttattat cttcaggaca "240 
ctcactacag ggaccaggga tgatgcaaca tccttgtctt tttatgacag gatgtttgct 300 
cagcttctcc aacaataaaa agcacgtggt aaaacacttg cggatattct ggactgtttt 360 
taaaaaatat acagtttacc gaaaatcata ttatcttaca atgaaaagga ntttatagat 420 
cagccagtga acaacctttt cccaccatac aaaaattcct tttcccgaan gaaaanggct 480 
ttctcaataa ncctcacttt cttaanatct tacaagatag ccccganatc ttatcgaaac 540 
tcattttagg caaatatgan ttttattgtn cgttacttgt ttcaaaattt ggtattgtga 600 
atatcaatta ccacccccat ctcccatgaa anaaanggga aanggtgaan ttcntaancg 660 
cttaaa 666 

<210> 77 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 31, 54, 125, 128, 136, 163, 168, 198 
<223> n = A,T,C or G 



<400> 77 

ctgcagcccg ggggatccac taatctacca nggttatttg gcagctaatt ctanatttgg 60 
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atcattgccc aaagttgcac ttgctggtct ct^rT,T^=.^4-4. 

catanganta tgccanaata aattccattt ?tt?^™^ gg^cttggaa aggtatcata 120 
tggtccacag cataacangc actgcctcct taSataa^ ^^''f ^^^^ gggctggttt 180 
attaagtgag aagggagact ctcagccttc agc2£Sf lltltTt^ taaagcatgg 240 
gaagtttttt aaacctctga atttgtacac a?ttaaaaS tllT^f'' tgtgactttc 300 
aatacttcta atgggaacaa aaaaaaaaaa aaaaaa ^"^^^^^^^ac tttaaaataa 360 

396 

<210> 78 

<211> 793 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

™f%«f'4':',lf' 732, ,40, „e, 

<223> n = A,T,C or G 

<400> 78 

SaSS SJSS? f,?gS?^ aaatoca,a. ttgccat.,a SO 

taccacagtc aaacctggag ccaaSagga ^SSc j!^'^^'^^^*^^ tggccagaga 120 
gaccctctcc agaggttggg gtgaccaS? SStaaart '^^^f^'^^ aactgcccca 180 
atataaatcc aagacaagca acIaacccS gSJ^^J^aS TT'^lt''^ aagaagctct 240 
aoacagtcna gctttaaaga aagtgtttgc ^gafa^J^^J catcacttgg atgagtgccc 300 
gcagtttgtc ctcctcaatc tggtSatga aacaa^f ^^^^t'^'^aga aattggcaga 360 
ccagtatgtc ccaggattat g?ttgttg?c ccToSTal TJt'^^^'' ctcctgatgg 420 
ggaagatatt cnaaccgtct ctatgcttac aaactio^^ cagttgaagc cgatatcctg 480 
atgaaaaagc tctcaagttg ctnaaaatoa attSf^ tacgctctgt tgcttgacac 540 
tctgtcggct tgaaaaSgl aac^^gSa Sg^gSaaa ttaTl^^^ -gccttctg 600 
gacacctgat taggttttgg ttatg£tcac caSftt^^f ^^^ctattgt ggaacanatn 660 
ttggttcaat tntctttttn aaacSSg ???cScntt nttttaaaat 720 

aataatnttt ggc ^ tttctacntt gnganctgat ttctaaaaaa 780 

793 

<210> 79 

<211> 456 

<212> DNA 

<213> Hcano sapiens 

<220> 

<221> misc^feature 

tilt r; If?;c1^/"- 353, 3»4, .23, „5, .36, 4„ 

<400> 79 

^Mg^ a~- .ctgttcjtg ctccaaaggg 60 

gcagctgttg agcgcaccta accaotggt? tlaol^^Jr. cactcttctt 120 

tcctcccgac cccangacca ggctac??^ ccccSrtc? tatT^T ^^^^^^^^ct 180 
tgcctctgat cgtangaatt gangantgtc ccaS??^^ tgcctccctc ctgcccctgc 240 
ggggctggaa atgggtgtgt ^gjgtgt^ g^ataSa^ ^^^^^tg gacagtggca 300 
tgcaagaccg agattgaggg SLIa^gS ?gSggS2 alt^t^.f gcnccccccc 360 
aantncccct gtgacnctca naaaaaaaaa aaaaJI Saccatgttt cctctccata 420 

<210> 80 

<211> 284 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 
<222> 283 

<223> n « A,T,C or G 



<400> 80 

ctttgtacct ctagaaaaga taggtattgt 
taaaactaaa agtaatgctc actttagcaa 
gaatagcatg acctccgtgc aaacaggaca 
aataaataaa tgtgtatatg tgtaacttgt 
aaatgtattt cttactgtga aaaaaaaaaa 

<210> 81 

<211> 671 

<212> DNA 

<213> Homo sapiens 



gtcatgaaac ttgagtttaa attttatata 60 
cacatactaa aattggaacc atactgagaa 120 
agcaaatttg tgatgtgttg attaaaaaga 180 
atgtttatgt ggaatacaga ttgggaaata 240 
aaaaaaaaaa aana 284 



<220> 

<221> misc^feature 

<222> 388, 505, 600, 603, 615, 642, 644, 660 
<223> n = A,T,C or G 



<400> 81 

gccaccaaca ttccaagcta ccctgggtac 
agcaagcggt gtgcacacgg agactcatcg 
gaaaggctgg ggatatttgg gttggcttgg 
tactaaaaca gtattatctt ttgaatatcg 
tcaagatggc tagaatggtg cctttctgag 
ttcaacacac ttccactgcc tgcgtaatga 
tttcaatgcc gtcattttca gttagatnat 
atttgattag tcttattttt ttatttttac 
tgtgacaaag tcaaataaac ccccnaggac 
acattaagct ttggccaaaa aatgttgcat 
canaaaggct ggctnataat gttggtggtg 
aaaaaaaaaa a 



ctttgtgcag tagaagctag tgagcatgtg 60 
ttataattta ctatctgcca agagtagaaa 120 
ttttgatttt ttgcttgttt gtttgttttg 180 
tagggacata agtatataca tgttatccaa 240 
tgtctaaaac ttgacacccc tggtaaatct 300 
agttttgatt catttttaac cactggaatt 360 
tttgcacttt gagattaaaa tgccatgtct 420 
aggcttatca gtctcactgt tggctgtcat 480 
aacacacagt atgggatcac atattgtttg 540 
gtgttttacc tcgacttgct aaatcaatan 600 
aaataattaa tnantaacca aaaaaaaaan 660 

671 



<210> 82 

<211> 217 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 35 

<223> n = A,T,C or G 



<400> 82 

ctgcagatgt ttcttgaatg ctttgtcaaa 
agacaataag tggtggtgta tcttgtttct 
tcttattagg gagttgtatg tcagtgtata 
aaattcttta aaaggaaaaa aaaaaaaaaa 

<210> 83 

<211> 460 

<212> DNA 

<213> Homo sapiens 



ttaanaaagt taaagtgcaa taatgtttga 60 
aataagataa acttttttgt ctttgcttta 120 
aaacatactg tgtggtataa caggcttaat 180 
aaaaaaa 217 
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<220> 

<221> misc_feature 

11% ' 

<400> 83 



2tS2S? LTcc^^S ^T--" ^0 

aacggagacg caggagaaga acaccctgcc gaccaSaa f^^^^f^^^^ agctgaanaa 120 
gagtgaaatt tcctaagatc ctggaggitt Sc?ac™^ angagaagcg 180 

cgtgatgtgg aggaagagcc acctgcaaaa taa^^ff ^^'^^'^"cg agaccccagt 240 
ctgggcactc cgcgccgatg ccacJggcct Il^Xl "^^^^^^^^^^^ cactgtgaac 300 
gactgccaaa ttctccggtt tgccccggS S??X^f Sraagggaccc cccccaatcg 360 
annataaaac acacctcgtg gincaSna aaaaaaaaaa """^"^^ atgaataatg 420 

460 

<210> 84 



<211> 323 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 



<2^>r:^?:c'^^/"' 3„ 

<400> 84 

SS^Sal llltltlf^t T^ltllToll Itl^lllZ -tggaaget SO 

l^Xlll :™ &s ™- -2 

cnancatctc tctagctgac cgatcaS^c TalT^f^ catgaacntg ctgtctcaat 240 
atttcctgta naaaaaaaaa gtcccagatt actacanatc ataataattg 300 

323 

<210> 85 
<2H> 771 
<212> mA 
<213> Homo sapiens 



<220> 

<221> inisc__feature 



iL'?ii!vl«;>" ,12: ,11: 



<223> n = A,T,C or G 
<400> 85 



.*a»1???S ~cJ ..... . 

gaagcaagca actttgactg ctatcttaa! f ^'=^5°^^° tccacctctg cggcgaatca 120 
attgtgggct tcacaigg J gcSgccaS a.^TJT ^^^"^^-^^ tcctaaattt lao 
cacacaaaga aaaagt?gtc ^g^gtgSca TalllttTj -«-\caatgc tatcatcttt 240 
gtgcgtctcc tcagtaaaaa agt?aaSac a^^^f agacttgggt gaaatatatt 300 
attggacata gcccaagaac aSaaSan^ atgtaaaaac tgtggctttt ctggaatgga 360 
atgganggtt tagtgc?tat TtltltT^ ScS' ^gaggtttca cttgcacf?c 420 
atcatattgc atcatanttt gctttgtS! ttgtccaatt natgaagtta 480 

gttatttata gctntaggtt ??ctg?g^Jt aarS^f^ naaattaaac tgtattttat 540 
ttggtntant gcaanttaaa aatt?tKS ^^''""tat acnaantttc ctaaactatt 600 
gocacaagct ttttttaaaa aaccanta^ Wggggggaa taaatattgg antttctgca 660 
tttgcttttn antagaaaat t?„^\%^%- -aatgltt 72^ 



wo 02/00174 



35 



PCTAJSOl/21065 



<2io> 86 

<211> 628 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 162, 249, 266, 348, 407, 427, 488, 518, 545, 566, 569, 597, 
598, 611, 617, 621, 624 
<223> n = A,T,C or G 



<400> 86 

actagtttgc tttacatttt tgaaaagtat 
cttgtgttag gtaagaatgg aatttattaa 
attatcttaa agctgaagcc aaaatatgct 
agttcataca ttcaaagcat ctgaactgta. 
gtggagaang aaatagatta atgtcnaagt 
aatctggggt tgaaattttc tagttttcat 
gaaatattaa tgtttacctt tcaatgtgtg 
ttccctnggg gatggggaat ggattattgg 
tcctttcnca gtttctggct cctaccctac 
catcntctgc tttattccca ttaatnaant 
ccaaggaatt nagtggnttc ntcnttgt 

<210> 87 

<211> 518 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 384, 421, 486 
<223> n = A, T, C or G 



tatttttgtc caagtgctta tcaactaaac 60 
gtgaatcagt gtgacccttc ttgtcataag 120 
tcaaaagaaa angactttat tgttcattgt 180 
gtttctatag caagccaatt acatccataa 240 
atgattggtg gagggagcaa ggttgaagat 300 
tctgtacatt tttagttnga catcagattt 360 
gtatcagctg gactcantaa cacccctttc 420 
aaaatggaaa gaaaaaagta cttaaagcct 480 
tgatttancc agaataagaa aacattttat 540 
tttgatgaat aaatctgctt ttatgcnnac 600 

628 



<400> 87 

ttttttattt tttttagaga gtagttcagc 

tataacaaca ttatactgtt tatggtttaa 

agtagtacag ttttaaaatt ttatgcttaa 

ttttacatgg caaatcaatt tttaagtcat 

aaacacattt aatttcaatt tctctcttat 

ctacagttta acaatgcagc aaaattccca 

ggttaaaatg ctttgaggat cctnaatacc 

naatttaacc ctcatgccat aagcagaagc 

taaaancgag ccccccgttg aaaaagcaaa 

<210> 88 

<211> 1844 

<212> DNA 

<213> Homo sapiens 



ttttatttat aaatttattg cctgttttat 60 
tacatatggt tcaaaatgta taatacatca 120 
aacaagtttt gtgtaaaaaa tgcagataca 180 
cctaaaaatt gatttttttt tgaaatttaa 240 
ataaccttta ttactatagc atggtttcca 300 
tttcacggta aattgggttt taagcggcaa 360 
ctttgaactt caaatgaagg ttatggttgt 420 
acaagtttag ctgcattttg ctctaaactg 480 
agggaccc 518 



<400> 88 

gagacagtga atcctagtat caaaggattt 
tattttattt tatttttcga gactccgtct 
ggtatttgct aaagcatttt gagctgcttg 
ttccatcttc ttggtgctgg gaagccatat 
agcttatgtg ttgaatttgc tacatctata 
gaaatagaaa tatcatagaa catttaagaa 



ttggcctcag aaaaagttgt tgattatttt 60 

caaaaaaaaa aaaaaaaaaa agaatcacaa 120 

gaaaaaggga agtagttgca gtagagtttc 180 

atgtgtcttt tactcaagct aaggggtata 240 

tttcacatat tctcacaata agagaatttt 300 

agtttagtat aaataatatt ttgtgtgttt 360 
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taatcccttt gaagggatct atccaaagaa aai-;,n-4-4- 

tctcagtaac agatcctgtg ttagtct?tg aSaL^^f^ ttcctacacg 420 

tagatgtagc atacatatga tgtataatg? SgJgSJt atL^'^f" tgtcagtgag 480 
ttgtaggaat acaaaacatg gcctttttL Saeaa^^^ f *^^<=aatg tctgcagatt 540 
acatagggca atctgtgaat atgtattata aS^ff ^ggccaatga ctagaataac 600 
taattttcaa gtcaaaaagg gatatgaaaa a^«!^^ ! cagaaaagta gttggtgaaa 660 
ccttgetttt aaattaaalg Stacag?ca? ^tStalltl^ agtaacctct attUt?aag 720 
taataatgtt aggttagcaa aggtt?agS TtlStTtt ^^^^aa agcttgagag 780 
aatgcagctc ttcgagtcat ttctggtL; ?caaaaSJ^ ^^^^^^^cta ccatgatag? 840 
gcaccctacc tcacctgctt actgaca??a ^nff^ f cacccttttg cccatagaaa 900 
T^l'tt^- -ttactgtat atalLtac^ gaSSta?: ItitTftl'^ ^"^tclgcc ITo 
accatattca aaacctaaat ttgtttttgc loSaa^^^ ttttcctttc ttcgtttttc 1020 
gctttcacct agaagggtgt ggtcctgaag gSaSn f^^'^^^ aagtgttcgt 1080 
ggtgctcctc cttccctggt accctgLta l^n^^f cctaaatatc ccccaccctg 1140 
agtgcagcag cctgtgc??c cacagatggg ggtgcS r^''^^"^ -gctggagi 1200 
ccatcttagg gggagaagct agatcctata til caacaaggct ttcaatgtgc 1260 

attgctcttc ctgctgctV c?tt?St? ZlTol^' S'Tr^' aggagg?tlc 1I20 
Staf 1? ^°"gtgcct ctgcttatgc atgagS SSf gtctagagca 1380 
atttgaagtt caaaggtgta ttcaggatcc tcllttfzlt ff*=^^^°aac cataaccttc 1440. 
coaatttacc gtgaaatggg aattSgc^g StStS! "^f'^'^^tgc cgcttaaaac 1500 
a^Sf'T .^^"-tataa gagagaStt JaaSaS? atTtltlf"^ aaccatgcta 1560 
aaaatcaatc tttaggatga cttaaaaatt gattSrn^^ Sftgtttttaa atttcaaaaa 1620 
ttacacaaaa cttgttttaa gcataaaatt ??aaa!S f f ^f^^ta tctgcatttt 1680 
llllftT^ tatgtattaa accataaaca gtaSSJ ^^ff ""r" gtattataca 1740 
aaatttataa ataaaagctg aaaaaaaaaa SaaaaaSa aaaa ^ aacaggcaat 1800 

<210> 89 
<211> 523 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mlsc feature 



<123: f !'?^^A 'JJl 338, 475, 511, 513 



<223> n = A,T,C or 
<4 00> 89 



tttttttttt tttttttagt caatccaca*- u 

gggataaaga tgactgttag tcSctcacJj Jaa"^f^ cttattatgt accaggcact 60 
acaatatgat gtagaaaatg ctJSoSag! JaSaf^ aactagcaaa taagacgatt 120 
tcaccttgtc tttccacatc ccta?cct?c acaaaS^ ^gtcctattg ggtcct?ctg 180 
ctccccactg cagatcccct gggattttgc ctaafan^ ^^^^^agctt cctgcccccg 240 
gccctggcat gacttgaacc oaaccaSaa ^^^^^^^^^^ acgagganat gggccccctg 300 
actttgatna gaaaacacat ^gg^^S 2Sn?I ^^^^<="t<=5 anagtggat? 36? 
ggtgctcaag aaaagtttgc agaataaata a!h^! f cccaaatggc caoccgtgct 420 
taattgaatg gtggctcaa't alga^^? nSS! "^^^^-tt aatanft^aa 480° 



taattgaatg gtggctcaa\^ aXafg^S SjgS a"? 

<210> 90 
<211> 604 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> ndsc_feature 
<222> 563 

<223> n = A,T,C or G 
<400> 90 
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ccagtgtggt 
gcaaaggaaa 
ctcacccacc 
gggagccttc 
ctctttcctc 
ctggaatgca 
cactgcattg 
aggggtggta 
accactaatg 
ccctaattgc 
cccc 



ggaatgcaaa 
tagccaatat 
aactaaatcc 
aagggcatgt 
tctgatcctt 
attgtttgca 
gtgataggac 
cacctgtttg 
gggagggcag 
tgaaattccc 



gattaccccg 
gtgtcgtttc 
caaagtcaaa 
agaaaatcag 
ttcctcttta 
acaccgaagg 
acggtaattt 
gtaaaatgag 
attattactg 
ctnggcaggc 



gaagctttcg 

tatgaaatga 
agcttcagcc 
ctgttcagat 
cggcacaaca 
atttcctgcg 
gattcacatt 
aagcctcgga 
ggatttctcc 
tccagttttc 



agaagctggg 
agccagaccg 
agtttatctc 
aggcctctgc 
ttcatgtttg 
gtcgcctctt 
taacttgcta 
aacttgggag 
tggggtgaat 
tcaactgcat 



attccctgca 60 
agatgtcaat 120 
agagaaccag 180 
accacacagc 240 
acagaacatg 300 
cagtaggaag 360 
gttagtgata 420 
cttctctcct 480 
taatttcaag 540 
tgcaaaattc 600 
604 



<210> 91 
<211> 858 
<212> DMA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<222> 570, 591, 655, 664, 667, 683, 
792, 794, 801, 804, 809, 817, 820 
<223> n = A,T,C or G 



711, 759, 760, 765, 777, 787, 



<400> 91 

tttttttttt ttttttttta 



tggcagagtt 
catttcctta 
ataaatgtct 
ttaaataagc 
agaatctcct 
atcccccggg 
gcccggtacc 
tacaacgtcg 
cccctttcgc 
cgcagcctga 
tggnggntcc 
ttcccttcct 
cctttanggg 
ggaaggtccc 



tctgatgctt 
ttcaagccat 
aaaacagcac 
ttggctaaaa 
gtataatctc 
ctgcaggaat 
caattcgccc 
tgactgggaa 
cagctggcgt 
atggcgaatg 
cccacgtgac 
ttctcgcacc 
tncnaattaa 
cgaagggg 



tgattattat 
aataaacatt 
gcttttctgt 
ctcgattctc 
tgggacatga 
accaggagat 
tcgatatcaa 
tatagtgagt 
aaccctggcg 
aatagcgaan 
ggacgcgccc 
cgntacactt 
gttcgccggg 
nggnttacng 



tttttttatt 
tgttctgatc 
gatattctga 
gtctataaca 
gtggaggtag 
tcaacgaatt 
gcttatcgat 
cgtattacgc 
ttacccaact 
agcccgcacc 
tgtagcggcg 
ggcagcgcct 
tttccccgnn 
gaccttngan 



gatctttaca 
agataagtgg 
tcctagttga 
ggactaagtt 
tcacacttca 
ccaccacact 
accgtcgacc 
gcgctcactg 
taatcgcctt 
gatcgccctt 
cattaaagcg 
tacgccggtc 
agctnttaat 
cccaaaaact 



tcctcagtgt 60 
aaaaaattgt 120 
acatacagaa 180 
cactgtgatc 240 
gcgaagaaag 300 
ggactagtgg 360 
tcgagggggg 420 
gccgtcgttt 480 
gcagcacatc 540 
ncaacagttg 600 
cggcngggtg 660 
nttcgctttc 720 
cgggggnctc 780 
ttgattaggg 840 
858 



<210> 92 
<211> 585 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 317, 319, 320, 321, 325, 327, 328, 330, 331, 332, 460, 462, 
483, 485, 487, 523, 538, 566, 584 
<223> n = A, T,C or G 

<400> 92 

gttgaatctc ctggtgagat tatacaggag attctctttc ttcgc.tgaag tgtgactacc 60 

tccactcatg tcccatttta gccaagctta tttaagatca cagtgaactt agtcctgtta 120 

tagacgagaa tcgaggtgct gttttagaca tttatttctg tatgttcaac taggatcaga 180 

atatcacaga aaagcatggc ttgaataagg aaatgacaat tttttccact tatctgatca 240 

gaacaaatgt ttattaagca tcagaaactc tgccaacact gaggatgtaa agatcaataa 300 

aaaaaataat aatcatnann naaanannan nngaagggcg gccgccaccg cggtggagct 360 
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c^^'tlcS tXla1t\^ ^^S^ ir''^^'^ ^^-^-^-^ 

gcntnangtg taaaagcctg ggggtgSS f ^^fS ! cncaacatac gagccgggaa 480 
tgcgctccac ttgcccgctt' llllll^^^^^ TXlX't tc^T^^^ ''^^"^'^^^ 
<210> 93 



<211> 567 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raisc_feature 



213^2?': 27I: 276' 2'77' aM' 2??' '''' '''' ^71, 272 

285, 286, 287 295,' 303 Voi' 314' III' f 284 

-vr: rJ:c^^'."' III' 

<221> misc^feature 

iL^=F:i££-"""-"^""- 

<400> 93 

cggcagtgtt gctgtctgcg tgtccaccti- ^ 

agactgcggc tggggtg^g^ aJSggga Iccgggg??? ^1.'^^^'=' gggaggacca 60 
ttocotgtac ccacottccc cttgctt^t gtJtSfnf^ atcttggaac 120 

ccagtttcct tgtgtgatac actaatgS ?tgc?«tt? f 9 ccggccaagc 180 

attaaattgc tantgtttct ttgaannnnj t^ggaaatan anaaaaatca 240 

ccncggngga aaonccccct tttgtSt ^^^^f^ nnnnnnnggg ggggncgccc 300 
gttaanccnt gggccaaanc tng?ScSg ^gSaf't it^^^^^ cncncntggc 360 
ccccccnncc ttccaaaccc ggaaanoctn annn?^? gttnatcccc tcccaaattc 420 
ngnaattnaa ccnaaccccc ?tt?Sa?ng nnStSncn ^^^=^9^999 gttgcctaan 48? 
nttcggggaa aaccctntcc gtgccca ccacnngccc cnctttccca 540 

<210> 94 ^^"^ 



<211> 620 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 
<400> 94 



ITSilZ r-r^^" « 

ss?- Hlf" sss: iis 

gttcttgtta tttccaaata gfSggartJ S^Jf ^ anatttaata atctga?caa 24S 
ataaggttaa aagttgttaa ?gaSaa" To^f^l fff^^^^ga gaagfggaag \ol 

ScStt: "'-"^^^^^^^ S2t-g tnactccga^' Je'S 

agggttaa^; ^tSgga ^^"^^^^^ aaattttcct ctttttaaaa attctttana 600 
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<210> 95 

<211> 470 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<222> 61, 67, 79, 89, 106, 213, 271, 281, 330, 354, 387, 432, 448 
<223> n = A,T,C or G 

<400> 95 

ctcgaccttc tctgcacagc ggatgaaccc tgagcagctg aagaccagaa aagccactat 60 
nactttntgc ttaattcang agcttacang attcttcaaa gagtgngtcc agcatccttt 120 
gaaacatgag ttcttaccag cagaagcaga cctttacccc accacctcag cttcaacagc 180 
agcaggtgaa acaacccatc cagcctccac ctnaggaaat atttgttccc acaaccaagg 240 
agccatgcca ctcaaaggtt ccacaacctg naaacacaaa nattccagag ccaggctgta 300 
ccaaggtccc tgagccaggg ctgtaccaan gtccctgagc caggttgtac caangtccct 360 
gagccaggat gtaccaaggt ccctgancca ggttgtccaa ggtccctgag ccaggctaca 420 
ccaagggcct gngccaggca gcatcaangt ccctgaccaa ggcttatcaa 470 

<210> 96 

<211> 660 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 299, 311, 360, 426, 538, 540, 542, 553, 563, 565, 592, 603, 
604, 618, 633, 647, 649, 651, 653 
<223> n = A,T,C or G 

<400> 96 

tttttttttt tttttttttt 
gcatttcttt tcattcgaat 
tgaagacttt ctgcttaatt 
gctttatagt acgtattttt 
tgtactgatt acaaggtcta 
cagcatctgg nggttggctt 
cttctgctga gctgggcctg 
gcctgncaca ggaactttgg 
aaacttgatg aagccttggt 
ancctgggct canggacctt 
gcnnagggac ccttgggncc 



ggaattaaaa gcaatttaat gagggcagag caggaaacat 60 
cttcagatga accctgagca gccgaagacc agaaaagcca 120 
caggggctta caggattctt cagagtgtgt gtgaacaaaa 180 
aggatacaaa taagagagag actatggctt ggggtgagaa 240 
cagacaatta agacacagaa acagatggga agagggtgnc 300 
ctcaagggct tgtctgtgca ccaaattact tctgcttggn 360 
gagtgaccgt tgaaggacat ggctctggta cctttgtgta 420 
tgtatccttg ctcaggaact ttgatggcac ctggctcagg 480 
caagggacct tgatgcttgc tggctcaggg accttggngn 540 
tgncncaacc ttggcttcaa gggacccttg gnacatcctg 600 
aaccctgggc ttnagggacc ctttggntnc nanccttggc 660 



<210> 97 

<211> 441 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 12, 308 

<223> n = A,T,C or G 

<400> 97 

gggaccatac anagtattcc tctcttcaca ccaggaccag ccactgttgc agcatgagtt 60 
cccagcagca gaagcagccc tgcatcccac cccctcagct tcagcagcag caggtgaaac 120 
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agccttgcca gcctccacct caqqaacrai- 

ccaaggtgcc tgagccctgc cac^ccaJIg t^clalTcl ^^^^^^^^^ ccctgcoacc 180 
agccatgcca ccccaaggtg cotgagccct at^tli^^ ? ctgccagccc aaggttccag 240 
agcagaanac caagcagSg taa?Sgg?c tl^ll^t gcaccagcc? 300 

agatgctgaa tcccctatcc cattSrtrt atS^^f^^ cccttgagga gccggccacc 360 
ctgtctcccc caaaaaaaaa 'a ^ ^tgagtccca tttgccttgc aattagcatt 420 

<210> 98 



<2H> 600 
<212> DMA 
<213> Homo sapiens 

<220> 

<221> inisc_feature 

<222> 295, 349, 489, 496, 583 

<^23> n = A,T,C or G 

<400> 98 



gcagccctgc atcccacccc ??cagc??Sa ocSS^^^" ^^^^^ttccc agcagcagaa 60 
tccacctcag gaaccatgca tcccSaaac SSJS f ^^"^^^^^ cttgccagcc 120 
gccctgccac cccaaagtgc ctgagccoJg ccaJcccL^ tgccacccca aggtgcctga 180 
caaggtgcct gagccctgcc ctLaatag? caSccaacf ^"^=^^^^9^ catgccaccc 240 
gcagaagtaa tgtggtocao agccatgccc ttSn^!^ ccagcccagc agaanaccaa 300 
cctatcccat tctgtgtatg agtccSJtt aclttttJ f^ccacoana tgctgaatcc 360 
aaaagaatgt gctatgaagc tttctttcc^ aoTaJ^T ^^^^^^^^ tctcccccaa 420 
ggtcttaant acaganctag ttttcagctg ctcTalltlt f^**'*''*^^^ tgaagctgaa 480 
tgaaaggeaa atgattcagc tccttat\a? SS^? ™ga 540 



<210> 99 

<211> 667 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 345, 562, 635 
<223> n « A,T,C or G 

<400> 99 



a'cca^'Sal lllllT^l ItT.ltttll S?^^^^'^ ^^^"^^^^ ^^-^gtttt 60 
ggtcctgacg ttttgagatc cllltln ^^^^^^^^^^t tcaggacaaa gcattcqaac 120 
tttctcttgt gaga?t?cco tcatc'tg'fa: SSal^ ^ttgtcatgg Jgaact^ag ^80 
agtagaagat ttgttgaaga catagafccc ttllHtJ f^"^^^^^^ tacaagcata 240 
ttaaagtctt gtgagcacct gggaltta^ a^a^^ff ttattaacct ttataaacat 300 
attttgtaag gctataattg ta^cttttS ^^f^^ tgttnatatt tttgatttac 360 
tggagatttt taagagttt? aaccagcjg: ?gSaSat: -ttggatttc tatgttgaaa 420 
g\X-- .™ ? iigSc J ~ ~S 480 
SJ-T" -c2.i: ---.a g.....,,, ,00 

<210> 100 
<211> 583 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> niisc_feature 

<222> 404, 506, 514, 527, 528, 538, 548, 556, 568, 569 
<223> n = A,T,C or G 

<400> 100 

gttttgtttg taagatgatc acagtcatgt tacactgatc taaaggacat atatataacc 60 
ctttaaaaaa aaaatcactg cctcattctt atttcaagat gaatttctat acagactaga 120 
tgtttttctg aagatcaatt agacattttg aaaatgattt aaagtgtttt ccttaatgtt 180 
ctctgaaaac aagtttcttt tgtagtttta accaaaaaag tgcccttttt gtcactggat 240 
tctcctagca ttcatgattt ttttttcata caatgaaatt aaaattgcta aaatcatgga 300 
ctggctttct ggttggattt caggtaagat gtgtttaagg ccagagcttt tctcagtatt 360 
tgattttttt ccccaatatt tgatttttta aaaatataca catnggtgct gcatttatat 420 
ctgctggttt aaaattctgt catatttcac ttctagcctt ttagttatgg caaatcatat 480 
tttactttta cttaaagcat ttggtnattt ggantatctg gttctannct aaaaaaanta 540 
attctatnaa ttgaantttt ggtactcnnc catatttgga tec 583 

<210> 101 

<211> 592 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 218, 497, 502, 533, 544, 546, 548, 550, 555 
<223> n « A, T,C or G 

<400> 101 

gtggagacgt acaaagagca gccgctcaag acacctggga agaaaaagaa aggcaagccc 60 
gggaaacgca aggagcagga aaagaaaaaa cggcgaactc gctctgcctg gttagactct 120 
ggagtgactg ggagtgggct agaaggggac cacctgtctg acacctccac aacgtcgctg 180 
gagctcgatt cacggaggca ttgaaatttt cagcaganac cttccaagga catattgcag 240 
gattctgtaa tagtgaacat atggaaagta ttagaaatat ttattgtctg taaatactgt 300 
aaatgcattg gaataaaact gtctccccca ttgctctatg aaactgcaca ttggtcattg 360 
tgaatatttt tttttttgcc aaggctaatc caattattat tatcacattt accataattt 420 
attttgtcca ttgatgtatt tattttgtaa atgtatcttg gtgctgctga atttctatat 4 80 
tttttgtaca taatgcnttt anatatacct atcaagtttg ttgataaatg acncaatgaa 540 
gtgncncnan ttggnggttg aatttaatga atgcctaatt ttattatccc aa 592 

<210> 102 
<211> 587 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_feature 

<222> 91, 131, 256, 263, 332, 392, 400, 403, 461, 496, 497, 499, 
510, 511, 518, 519, 539, 554, 560, 576 
<223> n =^ A,T,C or G 

<400> 102 

cgtcctaagc acttagacta catcagggaa gaacacagac cacatccctg tcctcatgcg 60 

gcttatgttt tctggaagaa agtggagacc nagtccttgg ctttagggct ccccggctgg 120 

gggctgtgca ntccggtcag ggcgggaagg gaaatgcacc gctgcatgtg aacttacagc 180 

ccaggcggat gccccttccc ttagcactac ctggcctcct gcatcccctc gcctcatgtt 240 

cctcccacct tcaaanaatg aanaacccca tgggcccagc cccttgccct ggggaaccaa 300 

ggcagccttc caaaactcag gggctgaagc anactattag ggcaggggct gactttgggt 360 
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gacactgccc attccctctc agggcaqctc ;=>n^4- 

ttcctttgaa aaagggcaaa ac^gaaaagg JcSttccS ^Qnctcttga acccagcctg 420 
ctttgccagg gcttcnntnt taccaaaacn ncttcS^^ naaaaagaaa aaccagggaa 480 
gcctccactt accnggggcn atgccccaaa aSaSt Scca^c^^'' tccccattng 540 
<210> 103 
<211> 496 
<212> DNA 

<213> Homo sapiens ' 
<220> 

<221> misc^feature 

II>"' "t- 

<Z23> n «. A,T,C or 6 
<400> 103 

anaggactgg ccctacntgc tctctctcot ^^4-= ^ ^ 

ctgcanccct tggncactgc anatg^Sc ctlttTatlt ^f^^^^^^^^ atggcagaac 60 
gcggtgggtc tccaccacaa ccac??tgac SStaScn ^^^^^^^^^^ ccctacccnt 120 
actggcagga tggaccttan ccnacatatc cctSoS^ "^^nanggtg gnttctcotg 180 
cccttaacat gatataatcc acccatgcaa nSgSaS "^^^^^ctnag anaaagaatt 240 
ttgcctaoag aatttcattc agtctacac? JSgcS ^=«^^9^tac catttaccat 300 
tgggctgacc gcaaaaggtg ccttacacar 11^^''^^^°^ ctctggcgat agagtgtggc 360 
gangcttgcc tcctccSc? ^l^tTol catgUoSt T'^'^"^ -^^-^c^lL 42? 
ggaaaagaaa caaaac catgttggat atcagggtgc tcnagggatt 480 

<210> 104 
<211> 575 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 18. 19 4s fio 1-7 

271, 273,' 30\l23,'V'3 3 '36f'37o''J"' '2^' ^38, 259, 263, 
440, 449, 450, 456 481 485' ill' 11^' ^82, 436, 

542, 552 ' 503, 510, 512, 515, 528, 

<223> n = A, T, C or G 

<400> 104 

gcacctgctc tcaatccnnc tctcaceai-,T 4. 

ctatggangt ggtttcnggg gtggcJctta n^f r^'^" tgcanaaact cctctgccaa 60 
ctgttcaact cngtttgig? ?g^ggg"c aaSnaaaf ^^f^'^^^ gtgtc?ctac 120 
tgttttggtg gaagggctgg taattqqcJt ta^^^T *^*99aa9cg gctnaactgt 180 
gaagttgcta ttgaaagtng c^gltlH ngn?ttS '^"^^"^^-S ttggcctngg 240 
ttgttnaatt tgggtgcttt gtnaa^gj^g gccc^Sc ^^""^ ctggtggcit 300 
ccnatgcngn aaacctcnac nnaaca|cc? ggacS^^ f ^^gcaatg aaaaaaatca 360 
cocccccaaa aaaggncaan cccctcaann llSltttf ''^^^*^9^aa aaagttgctc 420 
ncccnaaaac aaaaancccc ccntttccS ' ^ aaaaaatcct cgaatgggga 480 ' 

cnaaaaccct tntaaaaaac cccccgggaa Saa^^^^'^ ^aataconcc cccccf?t?a 540 

<210> 105 
<2H> 619 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> misc_feature 

<222> 260, 527, 560, 564, 566,* 585, 599 
<223> n A,T,C or G 



<400> 105 

cactagtagg atagaaacac tgtgtcccga gagtaaggag agaagctact attgattaga 60 
gcctaaccca ggttaactgc aagaagaggc gggatacttt cagctttcca tgtaactgta 120 
tgcataaagc caatgtagtc cagtttctaa gatcatgttc caagctaact gaatcccact 180 
tcaatacaca ctcatgaact cctgatggaa caataacagg cccaagcctg tggtatgatg 240 
tgcacacttg ctagactcan aaaaaatact actctcataa atgggtggga gtattttggt 300 
gacaacctac tttgcttggc tgagtgaagg aatgatattc atatattcat ttattccatg 360 
gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 420 
tttccaaatt tttgtacagt cgctgcacat atttgaaatc atatattaag acttccaaaa 480 
aatgaagtcc ctggtttttc atggcaactt gatcagtaaa ggattcncct ctgtttggta 540 
cttaaaacat ctactatatn gttnanatga aattcctttt ccccncctcc cgaaaaaana 600 
aagtggtggg gaaaaaaaa * 619 

<210> 106 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 21, 31, 32, 58, 75, 89, 96, 99, 103, 122, 126, 147, 150, 
158, 195, 210, 212, 219, 226, 246, 248, 249, 255, 258, 261, 
263, 265, 275, 304, 317, 321, 331, 337, 340, 358, 371, 377, 
380, 396, 450, 491 
<223> n « A,T,C or G 



<400> 106 

cattggtnct ttcatttgct ntggaagtgt 
gccttaaact ctgtnacact tttgggaant 
angtanagat gttctggata ccattanatn 
tatgtaaatg gtatntcatt cgctactatn 
gaatantnng cagcncanct nanangctgt 
acancattgt aacctcnatc nagtgagaca 
ttccaaatgg nctcatntcn aatgtttaaa 
atgttccacc aactagtacc tgtaatgacn 
gactgtggta ncccgcatcg gaaaaa 



nnatctctaa cagtggacaa agttcccngt 60 
gaaaanttng tantatgata ggttattctg 120 
tgcccccngt gtcagaggct catattgtgt 180 
antcaattng aaatanggtc tttgggttat 240 
ctgtngtatt cattgtggtc atagcacctc 300 
nactagnaan ttcctagtga tggctcanga 360 
agttanttaa gtgtaagaaa tacagactgg 420 
ggcctgtccc aacacatctc ccttttccat 480 

506 



<210> 107 
<211> 452 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 289, 317, 378 
<223> n = A,T,C or G 



<400> 107 

gttgagtctg tactaaacag taagatatct 
tcttttgaag catagataat attgtttggt 
tttaaagacc ctcctattct ataaaactct 
ctaaggttta caataggagt ggtgatttga 
gtggcataaa ttgcatcact gtatcatttt 
tggaaagtaa ctgtganaac ccagtttccc 



caatgaacca taaattcaac tttgtaaaaa 60 
aaatgtttct tttgtttggt aaatgtttct 120 
gcatgtagag gcttgtttac ctttctctct 180 
aaaatataaa attatgagat tggttttcct 240 
cttttttaac cggtaagant ttcagtttgt 300 
gtccatctcc cttagggact acccatagaa 360 
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-^^ctgetg gttgcttaaa 420 

452 

<210> 108 
<211> 502 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

tlfMl: foi'ltCll', 'II' '11' 231, 236, 259, 283, 235, 

488 ' 383, 409, 433, 446, 455, 466, 

<223> n = A,T,C or G 
<400> 108 

atcttcttcc cttaattagt tnttatttat ntattaaati- ^-^ . 
caaaaagaga ttgtagattg gcttctggcJ ccSc^aaaan "^"9^=^*9 tcctggcaaa 60 
agaccncaac tgaagcttaa aaaatctatc f f ^ ccataacaga aagtaccaca 120 

tanagcatat aaaactttta JcatntgSt lltTt^nt agaacattaa 180 

aaaatgtccc tttaacatnc aatatc?cac ItaSS^ f ntaatngaaa 240 

naaaaaaagg gtagaaggga tttaatgaaa actct^Jf ttnaggggat taccnngnaa 300 
ctccagaaca aaaacttn?c aanSt??ca oStltT ff^^tctgt ttanaaacgt 360 
aaactccatt agncccactt tctaang^c tcSnSt "J^"^^^"^ ggccactcia 420 
accctggnta ctcctgccct ca tctanagctt actaancctt ttgacccctt 480 



<210> 109 

<211> 1308 

<212> DMA 

<213> Homo sapiens 

<400> 109 



litlllllli ,~c1 g1a~ lllZT' ^—^^9 «0 

ggcatcttga ctgcaat?gg catggJcctc ctaaoa!... ^^^^'^"ctt ttcccctgtg 120 
ttggaggagg tgtttcactc tga^aaa^Jq llaltltntt ^^^gagccac cgcttcccag 180 
aaagaggtga ttgagaacac agaajcag?? caS^a^ ^aataaa ggctgaagaa 240 
ataagcaaac tcactaatga tLtJaactg aJcataSS tccaaaagtt tttgactgaa 300 
acatacctct tccttcaaaa atacttaaat t^^^^^ ^*=^9gctgtt tggagaaaaa 360 
gaacctgttg attttgtaaa tgcagcc?S SaaSSf "^^^"^^"^^ tgcatctctg 420 
gaaagcaaaa caaatgaaaa altcIagSc ?trt?cccaa T^Tl^^ ^tcctgggtt 480 
accaagctgg tgctggtgaa catggt??at ttS^^o^^ atggctctat tagtagotct 540 
aaagaaaata ctaaggaaga gaaattttaa ^^^^^f^^^^ aatgggacag ggagtttaag 600 
atgatgacac agagcLttc S?Sg"?c tlttt f^ 9-acaagtaa atctgtacag 660 
ctagggattc catataaaaa caacgacc^a atllln^l^^ aggacttgca ggccaaaat? 720 
gatggcctgg agaagataat agataaaSa Ja^ltTj ^^f^^^^^cc caacgacatc 780 
ccagggcata tggaagaaag aaaggtqaat atTr.tJt^ ^attggtaga gtggactagt 840 
agttacgatc tagaggcgg? cctggc^gcc SooaaaJf ««^9gtttga ggtggagglc 900 
aaagccgact actcgggaat gtcgcaggc tcltlTtat f ^^^^cctt cagtgagcac 960 
agttcctttg tggcagtaac tgaSaaggc accafaa^S ^^^-^f^^^g^^ gttcctgcac 1020 
tttactgtca catccgcccc aggSatSa aftnf ?f ? "^^gctgccac tggcataggc 1080 
ttcatcaggc acaatgaatc calcaJcSc ctcScttca T^^^f cttcctgt?c 1140 
gatgatcgtt gccatggcat tgctg?t?t? ajSaSial f *"tc ttctccttaa 1200 
tgattatgaa aatcgtccat tctt?taaaJ ggtggctc^c SgcatS^ S^tactcata 1260 

<2io> no 



<211> 391 
<212> PRT 
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<213> Homo sapiens 












<400> 110 














Met 


Asp Ser Leu 


Gly Ala 


Val Ser Thr Arg Leu Gly Phe 


Asp 


Leu 


Phe 


1 




5 




10 


15 




Lys 


Glu Leu Lys 


Lys 


Thr 


Asn Asp Gly Asn He Phe Phe 


Ser 


Pro 


Val 




20 






25 


30 






Gly 


lie Leu Thr 


Ala 


He 


Gly Met Val Leu Leu Gly Thr 


Arg Gly Ala 




35 






40 45 








Thr 


Ala Ser Gin 


Leu 


Glu 


Glu Val Phe His Ser Glu Lys 


Glu 


Thr 


Lys 




50 






55 60 






Ser 


Ser Arg lie 


Lys 


Ala 


Glu Glu Lys Glu Val He Glu 


Asn 


Thr 


Glu 


65 






70 


75 






'80 


Ala 


Val His Gin 


Gin 


Phe 


Gin Lys Phe Leu Thr Glu He 


Ser 


Lys 


Leu 






85 




90 




95 




Thr 


Asn Asp Tyr 


Glu 


Leu 


Asn He Thr Asn Arg Leu Phe 


Gly Glu Lys 




100 






105 


110 






Thr 


Tyr Leu Phe 


Leu 


Gin 


Lys Tyr Leu Asp Tyr Val Glu 


Lys 


Tyr Tyr 




115 






120 125 








His 


Ala Ser Leu 


Glu 


Pro 


Val Asp Phe Val Asn Ala Ala 


Asp Glu Ser 




130 






135 140 








Arg 


Lys Lys lie 


Asn 


Ser 


Trp Val Glu Ser Lys Thr Asn 


Glu 


Lys 


He 


145 






150 


155 




160 


Lys 


Asp Leu Phe 


Pro 


Asp 


Gly Ser He Ser Ser Ser Thr 


Lys 


Leu 


Val 






165 




170 




175 




Leu 


Val Asn Met 


Val 


Tyr 


Phe Lys Gly Gin Trp Asp Arg 


Glu 


Phe Lys 




180 






185 


190 






Lys 


Glu Asn Thr 


Lys 


Glu 


Glu Lys Phe Trp Met Asn Lys 


Ser 


Thr 


Ser 




195 






200 205 








Lys 


Ser Val Gin 


Met 


Met 


Thr Gin Ser His Ser Phe Ser 


Phe 


Thr 


Phe 




210 






215 220 








Leu 


Glu Asp Leu 


Gin 


Ala 


Lys He Leu Gly He Pro Tyr 


Lys 


Asn 


Asn 


225 






230 


235 




240 


Asp 


Leu Ser Met 


Phe 


Val 


Leu Leu Pro Asn Asp He Asp 


Gly Leu Glu 






245 




250 




255 




Lys 


lie lie Asp 


Lvs 


He 


Ser Pro Glu Lys Leu Val Glu 


Trp 


Thr 


Ser 




260 






265 


270 






Pro 


Gly His Met 


Glu 


Glu 


Arg Lys Val Asn Leu His Leu 


Pro 


Arg 


Phe 




275 






280 285 






Glu 


Val Glu Asp 


Ser 


Tyr 


Asp Leu Glu Ala Val Leu Ala 


Ala 


Met 


Gly 




290 






295 300 






Met 


Gly Asp Ala 


Phe 


Ser 


Glu His Lys Ala Asp Tyr Ser 


Gly Met 


Ser 


305 






310 


315 






320 


Ser 


Gly Ser Gly 


Leu 


Tyr 


Ala Gin Lys Phe Leu His Ser 


Ser 


Phe 


Val 






325 




330 




335 




Ala 


Val Thr Glu 


Glu 


Gly 


Thr Glu Ala Ala Ala Ala Thr 


Gly 


He 


Gly 




340 






345 


350 




Phe 


Thr Val Thr 


Ser 


Ala 


Pro Gly His Glu Asn Val His 


Cys 


Asn 


His 




355 






360 365 






Pro 


Phe Leu Phe 


Phe 


He 


Arg His Asn Glu Ser Asn Ser 


He 


Leu 


Phe 




370 






375 380 








Phe 


Gly Arg Phe 


Ser 


Ser 


Pro 








385 






390 











<210> 111 
<211> 1419 
<212> DNA 
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<213> Homo sapiens 
<400> 111 



ccagccacca ccgtctc?L aaaaaSga TatllltT '^^^'^ttcct agttcgttgo 60 
ggcgccgtca gcactcgact tggg?|?S cSScf^f aaatcatcat ggattcac?t 120 
ggcaacatct tcttttcccc t|?gggcSc SaJof ^ agctgaagaa aacaaatgat 180 
acccgaggag ccaccgcttc ccagSqoaa olaTjt!^ ^tggcatggt cctcctgggg 240 
agctcaagaa taaaggctga agaaaaaS^ SggtS^L' agagacgSg 3oS 

attgagaaca cagaagcagt acatcaaLa ?t?£a2at Itt^^^'^l^^ aggaaaagag 360 
tSJof ^ attatgaact gaacataacc aac^^g^rt aataagcaaa 420 

ttccttcaaa aatacttaga ttatgttgaa aaatf?^!?! *tggagaaaa aacatacctc 480 
gattttgtaa atgcagccga tgaaagtSa aaoaaa^fj atgcatctct ggaacctgtt 540 
acaaatgaaa aaatcaagga cttgt?cc?a gtSaolllt f"^=*9ggt tgaaagcaaa 600 
gtgctggtga acatggttta tttlaaaggg TataaaTr- "^9*«9<=tc taccaagctg 660 
actaaggaag agaaattttg gatgaa?!Ig TgTeSlat. ^'f ?"taa gaaagaaaa? 720 
cagagccatt cctttagctt Lc?ttccJg SgSaJ aao'''^^^^ ^atgatgaca 780 
Itnt acaacgacct aagcatgtt? SgStSac ^^^^^^^ tctagggatt 840 

gagaagataa tagataaaat aagtcctgag aaltSSf^ ccaacgacat cgatggcotg 900 
atggaagaaa gaaaggtgaa tctgcac?tg cccc!?Sta tccagggcat 960 

ctagaggcgg tcctggctgc catggggatg ITclltTj f^^^^^^ga cagttacgat 1020 
tactcgggaa tgtcgtcagg ctocggotto lltl^n ^^^S^Qagca caaagccgac 1080 
gtggcagtaa ctgaggaagg caccglgrt' gSSn T^^^^^ca cagttcc?tt 1140 
acatccgccc caggtcatga aaatgttc^ tarttl^ 2 ^^^gcatagg ctttactgtc 1200 
cacaatgaat ccaacagcat cctc?"??c lartn^tl^ ccttcctgtt cttcatcigg 1260 
tgccatggca ttgctgcttt taocSaJif 99cagatttt cttctcctta agatgatcqt 1320 
aaatcgtcca ttc\t?taaa tlSgS^ S^gca^^ ^'«^"^Sa llll 

<210> 112 ^^-^^ 
<211> 400 
<212> PRT 

<213>. Homo sapiens 



<400> 112 



..P Se, CX, „. v.. ... ^ ^ ^ 

..3 .lu U„ .J. ^„ „ 

Gly He L«o Thi il. lie gIv K.I: O.i , , '° 

JS' l." c^l, Thr ay 

^» ^» v.. PHe HX. oi„ .1. ^ 

3« ^, U. .y. Olu Olu Cl„ v,l ^, „, 

Glu Gly Lys Glu He Glu Asn Thr Glu Ala Jfi «■ ^, 8° 

Lys Phe Leu Thr Glu lie Spt- t„« t 95 

100 ^'^^ Asp Tyr Glu Leu Asn 
He Thr Asn Arg Leu Phe Glv ri., t 

Phe Gly Glu Lys Thr Tyr Leu Phe Leu Gin Lys 

Leu ASP Tyr Val Glu Lys Tyr Tyr H.s Ala Ser il^ gIu Pro Val 

H:rr""^^^S-------.3nSerTrp 

Val 0 u ser Lys Thr Asn Glu Lys lie Lys Is^ .eu Phe'p.o Asp 

Ser He Ser ser Ser Thr Lys Leu Val 12 v,i 

180 ^ ^al Asn Met Val Tyr Phe 

190 



wo 02/00174 



47 



PCTAJSOl/21065 



Lys Gly 

Lys Phe 
210 
Gin Ser 
225 

lie Leu 

Leu Pro 

Pro Glu 

Lys Val 
290 
Leu Glu 

305 

His Lys 

Gin Lys 

Glu Ala 

Gly His 
370 
His Asn 
385 



Gin Trp 
195 

Trp Met 

His Ser 

Gly He 

Asn Asp 
260 
Lys Leu 
275 

Asn Leu 
Ala Val 

Ala Asp 

Phe Leu 
340 
Ala Ala 
355 

Glu Asn 
Glu Ser 



Asp Arg 

Asn Lys 

Phe Ser 
230 
Pro Tyr 
245 

He Asp 
Val Glu 
His Leu 

Leu Ala 

310 
Tyr Ser 
325 

His Ser 

Ala Thr 

Val His 

Asn Ser 
390 



Glu Phe 
200 
Ser Thr 
215 

Phe Thr 

Lys Asn 

Gly Leu 

Trp Thr 
280 
Pro Arg 
295 

Ala Met 

Gly Met 

Ser Phe 

Gly He 
360 

Cys Asn 
375 

He Leu 



Lys Lys 

Ser Lys 

Phe Leu 

Asn Asp 
250 
Glu Lys 
265 

Ser Pro 

Phe Glu 

Gly Met 

Ser Ser 
330 
Val Ala 
345 

Gly Phe 
His Pro 
Phe Phe 



Glu Asn Thr 
205 

Ser Val Gin 

220 
Glu Asp Leu 
235 

Leu Ser Met 

He He Asp 

Gly His Met 
285 

Val Glu Asp 

300 
Gly Asp Ala 

315 

Gly Ser Gly 

Val Thr Glu 

Thr Val Thr 
365 

Phe Leu Phe 

380 
Gly Arg Phe 
395 



Lys Glu Glu 

Met Met Thr 

Gin Ala Lys 
240 

Phe Val Leu 

255 
Lys He Ser 
270 

Glu Glu Arg 
Ser Tyr Asp 

Phe Ser Glu 

320 

Leu Tyr Ala 

335 
Glu Gly Thr 
350 

Ser Ala Pro 

Phe He Arg 

Ser Ser Pro 
400 



<210> 113 
<211> 957 
<212> DNA 

<213> Homo sapiens 
<400> 113 

ctcgaccttc tctgcacagc ggatgaaccc tgagcagctg aagaccagaa aagccactat 60 
gactttctgc ttaattcagg agcttacagg attcttcaaa gagtgtgtcc agcatccttt 120 
gaaacatgag ttcttaccag cagaagcaga cctttacccc accacctcag cttcaacagc 180 
agcaggtgaa acaacccagc cagcctccac ctcaggaaat atttgttccc acaaccaagg 240 
agccatgcca ctcaaaggtt ccacaacctg gaaacacaaa gattccagag ccaggctgta 300 
ccaaggtccc tgagccaggc tgtaccaagg tccctgagcc aggttgtacc aaggtccctg 360 
agccaggatg taccaaggtc cctgagccag gttgtaccaa ggtccctgag ccaggctaca 420 
ccaaggtccc tgagccaggc agcatcaagg tccctgacca aggcttcatc aagtttcctg 480 
agccaggtgc catcaaagtt cctgagcaag gatacaccaa agttcctgtg ccaggctaca 540 
caaaggtacc agagccatgt ccttcaacgg tcactccagg cccagctcag cagaagacca 600 
agcagaagta atttggtgca cagacaagcc cttgagaagc caaccaccag atgctggaca 660 
ccctcttccc atctgtttct gtgtcttaat tgtctgtaga ccttgtaatc agtacattct 720 
caccccaagc catagtctct ctcttatttg tatcctaaaa atacggtact ataaagcttt 780 
tgttcacaca cactctgaag aatcctgtaa gcccctgaat taagcagaaa gtcttcatgg 840 
cttttctggt cttcggctgc tcagggttca tctgaagatt cgaatgaaaa gaaatgcatg 900 
tttcctgctc tgccctcatt aaattgcttt taattccaaa aaaaaaaaaa aaaaaaa 957 

<210> 114 
<211> 161 
<212> PRT 

<213> Homo sapiens 
<400> 114 

Met Ser Ser Tyr Gin Gin Lys Gin Thr Phe Thr Pro Pro Pro Gin Leu 
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em OXn Oin OXn Val ..3 CXn Pro Ser P.o P.o P.o om x.e 
Phe Val Pro .,3 Clu Pjo C^s His Se. val OIn Pro 

Oly Thr .ys Xle Pro OIu Pro OXy Cys .hr .,3 ^fx Pro «Xu Pro 
gy VaX Pro .Xu Pro OX, oy. H .^X Pro OXu Pro 

Oly C.3 T.r .,3 VaX Pro .Xu Pro .X. c.s rL .y. 

T.r .r .,3 VaX Pro OXu Pro OX, sL XXe ..3 VaX Pro 2p oXn 
Cly Phe ne ..3 P.e Pro OXu Pro OX^ Ma XXe ..3 VaX III ... 
Oly T,r .,3 VaX Pro VaX Pro OXy ... Jal Pro OXu Pro 

Cys Pro Ser Thr VaX Thr Pro GXy Pro AXa Gin r 

145 Ala GXn GXn Lys Thr Lys GXn 

Lys 155 



<210> 115 
<211> 506 
<212> DNA 

<213> Homo sapiens 
<220> 

<22X> misc_feature 

158rx?5,1iof'212?2X9' 22^6 'I'J^.t'' '''' '''' "0, 
263, 265, 275 304 1x7 321 ^58, 26X, ' 

380, 396, 450, Zi ' ^' ' 358, 371, 377, 

<223> n = A, T, C or G 

<400> 115 

'^TnToiii TtiiTiiii ^iii^z: r^T^^^ ^^"--^^ «° 

angtanagat gttctggata cca?tanSn JgcSccSS ^Lf ^^'f 99ttattctg 120 
tatgtaaatg gtatntcatt cgctactata antcaa^^ 9tcagaggct catattgtgt 180 
gaatantnng cagcncanct nanangrtg? cJ^^nSat? tttgggttat 240 

acancattgt aacctcnatc nagtgagaS naS^== ff"gtggtc atagcacctc 300 
tt.ccaaatgg nctcatntcn aa?g???aaa aajjant.r ^^^^^agtga tggctcanga 360 
atgttocacc aactagtacc tgtfaSS oacota^^f ■ ^^^taagaaa tacagactgg 420 
gactgtggta ncccgcatcg gSaaa aacacatctc ccttttccat 480 

<210> 116 
<211> 3079 
<212> DNA 

<213> Homo sapiens 



<400> 116 



TctiT,?, iiliiiiiii iii::Tt:i ^^^^^^^^^ ^--^^-^ 6o 

aasgaggtca aagtggttta Lgggggcjc tgfJqaSS ^^^^^gataa acctggcctt 120 
cttgcaggca gatctgccca gtgggcSg SaaS^ ccacattctc tggcctaaac 180 
tgtgcacaaa aggatgaaac Ictattttcc SSaqSS acaaaaaaat 240 

i:cc ctctagcaca taaccaagaa tataaggcta 300 
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cagattgcct ttcccagagg gaaaaccctg 
gcagatcact ggggaatcgt ttgccccccg 
caggtgctca gcatgtaccg tactgggatg 
ccaggacact gccatgccaa tgccccctca 
gccccagcct ctatggtgaa gacatacttg 
atcagtgctc gaaggcaagg ttatttctaa 
tgcaccccac accactgtgc aggtgtgacc 
ccagcccact taatcatcac agctcgacag 
taaaaagggg catcaccgtt cctgggtaac 
gttctctcca gcacctccca acccactagt 
ccaccatgtc tcgccagtca agtgtgtctt 
ccgcctctgc catcaccccg tctgtctccc 
ggggtggcgg tggtggtggc ttcggcaggg 
gctatggcag ccggagcctc tacaacctgg 
gtggtggcag cttcaggaac cggtttggtg 
gtggtgccgg tagtggattt ggtttcggcg 
gcggagctgg ctttggaggt ggcttcggtg 
gtatccaaga ggtcactgtc aaccagagtc 
ccagcatcca gagggtgagg accgaggagc 
ttgcctcctt catcgacaag gtgcggttcc 
agtggaccct gctgcaggag cagggcacca 
tcgagcagta catcaacaac ctcaggaggc 
gcctggactc agagctgaga aacatgcagg 
aggatgaaat caacaagcgt accactgctg 
tagatgctgc ctacatgaac aaggtggagc 
agattaactt catgaagatg ttctttgatg 
ctgacacctc agtggtcctc tccatggaca 
tcgctgaggt caaggcccag tatgaggaga 
cctggtatca gaccaagtat gaggagctgc 
tccgcaacac caagcatgag atctctgaga 
agattgacaa tgtcaagaaa cagtgcgcca 
agcgtgggga gctggccctc aaggatgcca 
tgcagaaggc caagcaggac atggcccggc 
ccaagctggc cctggacgtg gagatcgcca 
gcagactcag tggagaagga gttggaccag 
cctctggata tggcagtggc agtggctatg 
gcctcggtgg aggtcttgcc ggaggtagca 
gtgtcggcct aggtggtggg ctcagtgtgg 
gagggctggg ggtgggcttt ggcagtggcg 
ccaccacctc ctcctcccgg aagagcttca 
ttccaagtgc agcaacccag cccatggaga 
gttttatcct tttctggaga gtagtctaga 
ttcccaggag agccccattc ccagcccctg 
tcaaatcagc cttcaggttt cccacagcat 
tcccaaatct aaatcatcaa aacagaatcc 
taactacctc cagaatgtgt tcaataaaat 
gttttttttt tctacccaa 

<210> 117 
<211> 6921 
<212> DNA 

<213> Homo sapiens 



cagcaacctg ctgcctggaa aagtgtaaga 360 
ctgatggaca gcttccccaa gctccaaggg 420 
gttgtcaata ctcctggtcc tgtaagagtc 480 
gttcctggca tcctttttgg gctgctcaca 54 0 
ctagcagcgt caccaacttg ttgccaagag 600 
ctgagcagag cctgccagga agaaagcgtt 660 
ggtgagctca cagctgcccc ccaggcatgc 720 
ctctctcgcc cagcccagtt ctggaaggga 780 
agagccacct tctgcgtcct gctgagctct 84 0 
gcctggttct cttgctccac caggaacaag 900 
ccggagcggg gggcagtcgt agcttcagca 960 
gcaccagctt cacctccgtg tcccggtccg 1020 
tcagccttgc gggtgcttgt ggagtgggtg 1080 
ggggctccaa gaggatatcc atcagcacta 1140 
ctggtgctgg aggcggctat ggctttggag 1200 
gtggagctgg tggtggcttt gggctcggtg 12 60 
gccctggctt tcctgtctgc cctcctggag 1320 
tcctgactcc cctcaacctg caaatcgacc 1380 
gcgagcagat caagaccctc aacaataagt 14 40 
tggagcagca gaacaaggtt ctggaaacaa 1500 
agactgtgag gcagaacctg gagccgttgt 1560 
agctggacag catcgtgggg gaacggggcc 1620 
acctggtgga agacttcaag aacaagtatg 1680 
agaatgagtt tgtgatgctg aagaaggatg 17 40 
tggaggccaa ggttgatgca ctgatggatg 1800 
cggagctgtc ccagatgcag acgcatgtct 1860 
acaaccgcaa cctggacctg gatagcatca 1920 
ttgccaaccg cagccggaca gaagccgagt 1980 
agcagacagc tggccggcat ggcgatgacc 2040 
tgaaccggat gatccagagg ctgagagccg 2100 
atctgcagaa cgccattgcg gatgccgagc 2160 
ggaacaagct ggccgagctg gaggaggccc 2220 
tgctgcgtga gtaccaggag ctcatgaaca 2280 
cttaccgcaa gctgctggag ggcgaggaat 2340 
tcaacatctc tgttgtcaca agcagtgttt 2400 
gcggtggcct cggtggaggt cttggcggcg 2460 
gtggaagcta ctactccagc agcagtgggg 2520 
ggggctctgg cttcagtgca agcagtagcc 2580 
ggggtagcag ctccagcgtc aaatttgtct 2640 
agagctaaga acctgctgca agtcactgcc 2700 
ttgcctcttc taggcagttg ctcaagccat 27 60 
ccaagccaat tgcagaacca cattctttgg 2820 
gtctcccgtg ccgcagttct atattctgct 2880 
ggcccctgct gacacgagaa cccaaagttt 2940 
ccaccccaat cccaaatttt gttttggttc 3000 
gttttataat ataagctggt gtgcagaatt 3060 

307 9 



<400> 117 

gaattctgac tgtccactca aaacttctat 
aattgagata accatttaca aagacgatga 
taaatggaag gtcattagtc ctactgggaa 
cgttcctcca ccaaacaaag aagcggtgga 



tccgatcaaa gctatctgtg actacagaca 60 
atgtgttttg gcgaataact ctcatcgtgc 120 
tgaggctatg gtcccatctg tgtgcttcac 180 
ccttgccaac agaattgagc aacagtatca 240 
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300 
360 
420 



?:SSa1? cataaacat, aa.a.t,ta, tatcct.,ca 

gctacctggt gaacatcagc aag?Staag ?a2c?a«* f ^^^^^^^ taaagacLt 
ggaagatagc caggaatccc aagtcttS ^acT^T.l *f ^^^tttg aagattttct ..u 
ggttaatgta tgtaagcagt at?atcaaga ac??c??!ai f f tggaaaagga 480 
agaggaatca gtttataatc tctacatctc toaaa^^^ tctgcagaaa gagaggagca 540 
gaactgtgaa gatcggctga ttag^^agat Jcgaartcc^ ^^^^^^^^ ttcggttaga 600 
tgaaagtgtg ttcagaatca cagaacagga gaaJStaaa^ <=tggaaagag atgatttgca 660 
agatgatttg ggaacaatca caaataag?g ?gag2a?tt ^f^^T*^^ aacgacttaa 720 
ttcatcagtc cctaccctac gatcagagc? taJ?atao^^ ttcagtcaag cagcagcctc 780 
ctattctatg tcttccactt acataga^al g??J2a2t ^*^«ff^^ tgaaccaagt 840 
cactcaagct gcagaagccc tcgtaaaac? cta?aaaS tgttaaaaaa 900 

agttatagct gacaagaata atattgagaa ^cKataS f ^^^9^9*9 aagaagaagc 960 
tgaagtagat gaaaagagac aggta?tcca tgcctUof! ^^^^^^f^^^ aatggagatc 1020 
agccatcagt gatgaaatgt ttaaaacota lllTll^^ gatgagttgc agaaagctaa 1080 
caaagaaaaa gcagatcaS tagJtga Jg gtggSS ^t.TJT' ttgactggca luS 
caggttacgg gacttagagg gcattfgcaa ScSa« ^9^9° agattgacaa 1200 

tcctttagat gattggatcc agcaggttga TJ^T^l^l tactacagag acacttacca 1260 
gcctgaaaat agtaaaaccc tagcScaca rtta^Sf^ agaaagattc aggaaaatca 1320. 
aatagaaatg aaacagagca aaatgga^a SSaaaa ^^f^^^^'c tggtgtccga 1380 
tacagtgaag gactatgaat tacaaacaa? f£ll^l tatgcagaac agtactcagc 1440 
aaaatctcca gtgaaacgcc gaagaatgca gaS^oaa^f gocatggtag attcacaaca 1500 
catggaccta aggactcgat atactgc?c? g^Jctctc ^^^^"^ ttcaagagtt 1560 
tgctggtgat tcattgaaga gqctaaaao, 9gtcactctc atgacacaat atattaaatt 1620 
tgaacatggg gcatattcfg a?SS?S gcgtSaS^ "^^^59tgta aggagacttc iSS 
caaacttaca ggaaagataa gtgagttgga faSataat.^ gcaacagtgc ttgagaatag 1740 
gtcccgagta gaggaagaac ttccgaagS cJ^oJaaaof 9<=tgaactaa agaaacaaaa 1800 
gcagcagaga aatgtagaag atatctcS? gcaJaaaS f ^^aaaatg aattgagaaa I860 
gcagtaccgc agggaacttg aaaccattgt gaaaoaaa^^ ^gggctgaaa gtgaagccaa 1920 
ggagcgggtg aggcagctca ccatagajgc ll^l^ll^ gaagccgctg aaagagaact 1980 
cctcctgaat tttcgcaatc agttggS aaa?S?t f ^5°*9ccg tggaagagaa 2040 
tcatcttaaa agaaaagatt taagtctcaa ta^^f^^ aocagacgaa caotggaaga 2100 
ggaagaatta agaagaaaga gagacaatga • ggSa^Sc ^^^^^^ ataaattaat 2160 
ggaaaaagac cttgcatttc agaaacaaot lallT "saagctga taaagcagat 2220 
aattgaattg gaagcaagaa gaaaSSac tScaa iT^Tl^ ^aaagcajaa 2280 
attgccagtg tgtccgatca cacaggctac lUllTr. ^ tatacatgta gagaaaatgc 2340 
agaacatgac aagcagaaag cagaagSct llltl '''' 9cagtaacgg gtctccagL 2400 
caatagaaag gctgaacaag acItglSg^ gctg^Stat tlltt^T *^^-^9c?gc 2460 
tgaaaaaacg tcatctgagg aaaaaqctca t^;^^^ 9aacttaatg ccctccagct 2520 
taatacactc agatgcct?a agttggagS ggaSaao ^^^^^^^^^ atgaaacLa 2580 
ttctcaacaa ctcagagagc ttggtaSca lllallt^ gatcaggcgg agaaagggta 2640 
agccatgcaa gaagctagtg atctcafaaa accacaggta aagctgaaga 2700 

atctcttaat catgaaaaag ggaaactaJa lllT^T ^^^^^^cagt tagaa?taga 2760 
tgctgtagct gagaagaatf ttSgcattt aaSfcf ^ ^acagaatca caagggcaL 2820 
gaaagaatta gaaagactac aaatctgcca T^lV^llT ^"^^^ctt ttcgagatga 2880 
tgagaaaagc catgagcagt tgcttcJaaa StSaaaoct ^^^"^"""^^^ aagaacaalt 2940 
aaaa^r^^' ctcaatgaag aattggagaa aagtaatoa^ ?::^^^9aaa ataatgataa 3000 
aaaagtagag gagcttacta ggcagaataa Ta f^lTr.^^ tgtgcagaga tgctaaaaca 3060 
lllltll^''^ gagaatatag ^tttagagaj aSJacStc ""^^9atgc agagaattca 3120 
gaaaattcag gcagatggtt ttaaaga?ca gSacaJ^^. °^9caaagat gtgaagcact 3180 
imiT^^ acagagcagg attttcaaag aaaSta!^ acaaatgaac acttgcataa 3240 
gaaaagtcaa aatttggtaa gtgaatttaa TclllUl l ^gcctagaag aagacctggc 3300 
ccagaatacc aagaaagaag ?tagaaaS gaJtacf ^ gaccaacaga acattatcat 3360 
gaagcgacgc ggggagcaga aag?tcagS acaa^f ctgaatgctt ccaaagaaga 3420 
caggttgaaa aaagtacaag acgaattaS lltlT.T^ '^aggtgcaag agttaaatfa 3480 
cagaaagatg gttctgtttc aggaagJatc llllllllll ^*^9aggagc agatgaccca 3540 
tcggaagaag atggaaaaat taatggagtc cSaot^=J ^^t'^^^^ cagaggagtt 3600 
cattaggctt gactttgtgt ctctlcafca ag^ScSt aaa^^^*^ atatttc^gg 3660 

a aaactct agagcccaag aaaatgctaa 3720 
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gctttgtgaa 
gcagcaaggg 
gctgatagcc 
agagcatgaa 
agactgtacc 
gctgtcctct 
gactcaagaa 
caaagaagtc 
ttactctgag 
ccccattaca 
accacctgtt 
cttagagata 
agaaaagaac 
tggactcaag 
tgatggtgat 
cactgtgact 
gctgcgactc 
gaaagccacc 
ctcagcggcc 
ggctgcaaca 
agttcttaaa 
agctgtggga 
aatgcttgac 
cattgaccct 
gaataatgcc 
tcccaataac 
agagtcccaa 
gaaaacacat 
ggctttccag 
tcagtggaag 
tactaaaaca 
agccttggtc 
gctgagccgg 
tgctagtggg 
tactgccctc 
tggcaaaaag 
ccagcagctg 
aatgatgtca 
atgtttggaa 
atcaatagaa 
agatcaaaag 
taaagaagcc 
atctgagccc 
taaataactg 
tatcggctac 
attgctaagt 
cttcaaatgg 
tttttttttt 
tttcttcaga 
ctgaaagagc 
cagccttcaa 
gatatagtga 
tcttcgcact 
ttcattctgt 



acaaacatta 
cagcacatgg 
cagaagcgtg 
catcaattag 
ttcaaaccag 
agaaacactg 
ccacagccat 
caattccagc 
tacttttctc 
agactgtctg 
aggtatcaag 
gctaagaaca 
ccagttccca 
aaaggggatt 
catgcatgtt 
gccaggcagt 
ggtcttaaga 
tcaattgcag 
gagagaatca 
ggttttataa 
ggagttgttg 
tattcttatt 
agacaaaaag 
gtgagaggca 
atcttacagt 
aagcaagctc 
tgctttctgt 
agaatttctg 
agaaacctga 
gaagctatgt 
ggattacact 
aaaaagtatc 
ttagtcccca 
gaaaggatct 
cgatgccttg 
taccgggtgg 
cgacagtgtg 
gtggtggaag 
tttcagtact 
gaggctctcc 
tcatatgtca 
ttagaaaaag 
ctgatgacag 
tgcaaggggt 
atatgcagtc 
gctcaaaata 
tttcatttag 
tttacgtaga 
actccccttc 
acgtcatgaa 
gctgcagtgt 
ccttctttgc 
tgaaagctaa 
gtattttccg 



aagaacttga 
aagcaaatca 
aggttgaaaa 
ttttgctcca 
attttgagat 
gacaccttca 
tggaagagaa 
caccaggggc 
agacaagcac 
aaattgagaa 
ataacgcatg 
agcagtatga 
gtgctgaaga 
tccttaagaa 
cagtcaggga 
tggtggaagc 
ctgttgaaga 
ggctttacct 
taatagacaa 
ttgatcccat 
accccgaatt 
cttctaagac 
gtaaacatat 
ttcgtgttcc 
ttttacatga 
tgtattactc 
ttccatttgg 
tagtagatac 
ttgagaaaag 
tttttgaatc 
tcaatattaa 
aggaaggcct 
agaaagattt 
ctgtactaaa 
aagcccaagt 
ccgaagcttt 
aattagtaat 
ctgtgaatgc 
tgacaggagg 
aagtaggtat 
gaaatataat 
ctgattttga 
gaatttctag 
gatgcaggct 
tgtgaattat 
gagtaagttt 
ccttgagaat 
atgtgggata 
attgaatagt 
gcaccatgga 
tttagattgc 
atattaaaat 
cattatgaat 



aagacagctt 
ttaccaaaaa 
cctgaagcaa 
gtgtgaaatt 
gacagtgaag 
cccaacaccc 
gtggcagcat 
tccactcgag 
cgagttacag 
gataagagac 
tgaaatggaa 
tatgcataca 
atggatgctt 
gggcttagaa 
tgatgaattt 
taagcttctg 
agttcagaaa 
agaatctaca 
aatggtggct 
ttcaggtcag 
cagaattagg 
attgtcagtg 
cttggaagcc 
tccagaaatt 
gccatccagc 
agaattactg 
ggagaggaac 
taaaacagga 
tatatatctt 
ctatgggcat 
tgaggctata 
catcacactt 
gcacagtcct 
agcctcccgt 
cagtacaggg 
gcatagaggc 
cacagggatt 
aaatattata 
gttgatagag 
tatagatgtc 
atgccctcag 
tttccacaca 
cctctactat 
ggttcatgcc 
gtaacatact 
taaattgaaa 
ggttttttga 
aacttgatga 
gatcatttat 
atcaaagaga 
ttcaaaaaat 
gtttaccaca 
attatgtgtt 



caacagtatc 

tgtcagaaac 
aaaatggacc 
caaaaaaaga 
gagtgccagc 
agatcccctc 
cgggttgttg 
aaagagaaaa 
ataacttttg 
caagccctga 
ctggtgaagg 
gaagtcacaa 
gaagggtgca 
ccagagacct 
aaattccaag 
gacatgagaa 
actcttaaca 
aaagaaaaga 
ttggcatttt 
acatattctg 
cttcttgagg 
tttcaagcta 
cagattgcca 
gctctgcagc 
aacacaagag 
cgaatgtgtg 
atttccaatc 
tcagaattga 
gaactttcag 
tcttctcata 
gagcagggaa 
acagaacttg 
gttgcagggt 
agaaatttgg 
ggcataattg 
ctggttgatg 
ggccatccca 
aataaggaaa 
ccacaggttc 
ctcattgcca 
acaaaaagaa 
ggacttaaac 
tcttcctaat 
actttttcag 
ctatttcttg 
attacataag 
aacttggcca 
actccaagtt 
taaatgataa 
aagatataaa 
gaaaaagttt 
atgtcccatt 
ggaggagggg 



gtgaacaaat 
ttgaggatga 
aacagatcaa 
gcacagccaa 
actctggaga 
tgttgagatg 
aacagatacc 
gccagcagtg 
atgagacaaa 
acaattctag 
ttttgacacc 
cattaaaaca 
gagcatctgg 
tccagaactt 
ggcttaggca 
caattgagca 
agtttctgac 
tttcatttgc 
tagaagctca 
ttgaagatgc 
cagagaaggc 
tggaaaatag 
gtgggggtgt 
aggggttgtt 
ttttccctaa 
tatttgatgt 
tcaatgtcaa 
ccgtgtatga 
ggcagcaata 
tgctgactga 
caattgacaa 
ctgattcttt 
attggctgac 
ttgatcggat 
atcctcttac 
aggggtttgc 
tcactaacaa 
tgggaatccg 
actctcggtt 
caaaactcaa 
agttgacata 
tgttagaagt 
gggacatgtt 
agtatgatga 
agggctgcaa 
atttaatgcc 
cactaaaatg 
cacagtgtca 
attgcactcg 
ttcgttccca 
tgcctttttc 
tctagttaag 
aaggattttc 



3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4 620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 , 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6921 
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<211> 946 
<212> DNA 
<213> Homo sapiens 



<400> 118 



IpcTol ZZllll Tctllf^Z ----^ --ctt.t. tcctt.tccc eo 
gtcggcagtg gcttaggcct gggtggagga aq^octf ^ 9ctatggcgg tgccagtggt 120 
gttggaggtg gcttcagttc cagcagtggc TalaaT.t^ cctatggcag tggtcttggc 180 
ggaggcggca gttccaccat caagtacJcc IZt t I Mggtggcct cagotctgtt 240 
aagcactaaa gtgcgtctgc tag^t^t^gg ScacaaS ^^''^^^^^^ gaagagcLt 3oS 
cagagccctc tcctcaggtt occtai-r^n^^ ^cccacagtc ctcaggcccc tctctqacta 3fin 
ggtagagctg gggatgStg ?t2Sgccc tcSr^.*" ^^^'^^^^^ tgctg?Sf 420 
gagcacccat tgctcacca? cagaSacc ^c?g2Sta tataccatct 48? 

tggagcttca ctgttactaa attattaatt ^cS^^Jn "^^^^f^^atg taatcaccac 540 
gagcattata agaaaatgac ctctgctcS t^clTalt "^'^^^^^t ctctgaggct 600 
ttcagaacaa cttccactta ctttccactg gctcSafac f f^f"S°<= aggggcttat 660 
tgaaccccca cccaggcagt atccsi-n^Jl gc^^ctcaaac tctctaactt ataagtqtto 720 
gcctgtatct ctgtStgJt' tlT.Zto i^a'St'S T'^^'^ gatgSSS 7 o 
acJJf^^f f catatattct tttact^tgc ctS^aa ^^^^"^cta aataaagcag 840 
acttttttat ctgataagtg aatagttg?t tSSaga? aaSr^'' ttgggcttaa 900 
<210> 119 
<211> 8948 
<212> DNA 
<213> Hcmo sapiens 

<400> 119 

a= ^?g-S ---c cacccgaeoa 60 

5 SggS g^-— S--- - 

gtccgcctat ccttggcccc ctccgS??? Jc^acaf^'^ tcccgcccgg ttccccggc? 240 
gcgctgagcc gctctcccga ttgcccocca ^^f^^ cccgcctcgc ttatgcctcg 300 
ggatoaacac tctgggc^o atlScgcg ccSt'cta^ — gg-g^c tcccLccg? 36°S 
ac™!T ^^^'=9^999'= accagcagg! ?g?;Sa^?c ^"^^^-^^^^ cgctacgagg 420 
accagaactc ggacggctac tgtcaaaccg acacaaS. tcggcgcggc gtgatcaccg 480 
ccatccagga gctgctgcag aactgctcca acS^^ f caggcaccag aaccagaaca 540 
agcctgaatt. gaagtatgga gatqStao f!^? g^gagcagag ctcatcgtgc 600 

gttttgccca ggccaatSc caaa?ggaa: tt^olTcll \T ttggatga^ 660 
agatgggcca gccctgtoat aei-fa^^I T:cctcgacag cttgatcaga gagatocoae 79o 
gagcccttta taaag^catc iTgtcoTc ^^^f tcagctccaa Ulllllll Vso 
a'St\T' "gtcagagt g^t'Sggct g'g^S? .T^^^^^^^^ aagggtgg?g 840 
aatgtttggg gtggatgagg caocaaaaar^ gggatgagtt caccaaacat gtcaccaqtq 900 

tllltllT ciiTttTa glZltlZ ^l'^^^-- Sgggtgt'S tlo 

actatcgctg gcagctggac aaaatcaaag coZZlatt tccatcggcg 1020 

agttggagga ggagtatgaa aacctgctgl aagcStS ^^^^^^^^^^^ gcgatctacc 1080 
gacagctgca gaacatcatt caggccacqt ™^ ^ tgagaggatg gatcacctgc II40 
aggaggagga gctgctgtac gac^ggagS l^tT^^^ '^atgtggatc aatgactgcg 12o2 
aggccttctc catacgcatg agtcfacfgg aaqttaafaf "^^^^^^^ct cagaaacagg 126^ 
aaoaagaaag tgaccaaott gtcctcaatc aao!f ^ aaaagagctc aataagctga 1320 
atatggacac tctgcagacg SgtggaSt gg'attctS: "^'^'^"^^ attga^gcS 1380 
catao^i^^^ ^gaaaatgct gcctact?tc frttm^^J gatcaccaag tgcattgatg 1440 
catacctgaa ggggctccag gactccatca aaZlll t agaggcgcag tctactgaag 1500 
SSf "t^ctgga^ cagatcaagj SaSa f^r^"""" -gaacftg? 1560 

s »s a~ ~ s 

aaoaagatca gaaaatcgtg eata.ggggg ™2 ~tc .g.ga^^^^^^^^ ,,,, 
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gcagcaagtg gtacgtgacg ggcccgggag 
tgatcatccc tcctccgaac ccactggccg 
acgaagccat cttggctctg tggaaccagc 
ggcactactg catgattgac atagagaaga 
caatgcggca ggaagattac atgaagacga 
tcatcagaaa tagccaaggc tcagagatgt 
ctcagttcac cgatgcccag aagcattacc 
cccagcacca gacagtgacc acaactgaaa 
accataataa agtaattgaa accaacagag 
tggagctgca gaagattcgc aggcagatag 
acctccctct agcagaccag gggtcttctc 
agagtgtgca gaatgattca caagcaattg 
ttgccaactt cagaggttct gaaaagtact 
ttcagaaact ggaaaatatc aatggtgtta 
taagggcact gctccaggct attctccaaa 
ggctcactga ggaggaaact gtctgcctgg 
gactgaagaa aataaaaaat gacttgaact 
cagaactaca gaaagcccag cagatccact 
atctggactt gggcaagttc ggtgaaaaag 
tagataaaca gatcgacttt agattatggg 
attatcgtga taactatcag gctttctgca 
attccttaga atccatgaaa tttggagatt 
agaagaactt gcacagtgaa atatctggca 
ttgctgaact ttgcgccaat tcaattaagg 
caggactgga aactctgctg aacataccta 
gggtgattct gcaagaggct gcagatgttc 
ctggagacta ttacaggttc ttaagtgaga 
aaaataccaa gatcgaagtt ttggaagagg 
aaaactgtaa taagaacaaa ttcctggatc 
cccagttcaa agcgaagctt gcgagcctgg 
ggaagtcggc taagcaaaat ctagacaagt 
agatcacccg actgacttat gagattgaag 
acagatttga ccaacagaag aatgactatg 
aggagaacct tggttggcag aaattagagt 
agattgaaag gttgagggtt ctactgcagg 
atgagctggc aaaggtaaga aaccactata 
atgaaacaga gattaacatt acgaagacca 
atgattccaa aaatcttaga aaccagcttg 
aggatgaaat tgtcaggctc aatgacagca 
ctgaagaaaa cgcccttcag caaaaggcct 
atctggagat agaactgaag caggtcatgc 
agcagtccct ggaggaggct gccaagacca 
tcaaagctga gtttcaggag gaggccaagc 
aggtaagaaa caattatgat gaggagatca 
tcaacatcac caagaccacc atccaccagc 
gctaccgggc tcagatagac aatctcaccc 
agaggctgaa gaacactcta acccagacca 
tccaacagca aaaggccact ggctctgagg 
agctgagaca agtcactcag atgcgaacag 
atgatgctgc caaaaccatc caggataaaa 
tcgacaaaga aacaaatgac cggaaatgcc 
tccagtatga cctgcagaaa gcaaacagta 
ttcaggagca agaactgaca cgcctgagga 
ctgtgaagga ccaggatatc acgcggttcc 
agcagaaggt ggaagaggag ctgaatcggc 
agaggaagaa gctggaggaa gagctggaag 
tcaaaatcac caacctgacc cagcagctgg 
aggatgacct ccggcagcag agggacgtgc 



gcgttgacat gcttgttccc tctgtggggc 1860 
tggacctctc ttgcaagatt gagcagtact 1920 
tctacatcaa catgaagagc ctggtgtcct 1980 
tcagggccat gacaatcgcc aagctgaaaa 2040 
tagccgacct tgagttacat taccaagagt 2100 
ttggagatga tgacaagcgg aaaatacagt 2160 
agaccctggt cattcagctc cctggctatc 2220 
tcactcatca tggaacctgc caagatgtca 2280 
aaaatgacaa gcaagaaaca tggatgctga 2340 
agcactgcga gggcaggatg actctcaaaa 2400 
accacatcac agtgaaaatt aacgagctta 2460 
ctgaggttct caaccagctt aaagatatgc 2520 
gctatttaca gaatgaagta tttggactat 2580 
cagatggcta cttaaatagc ttatgcacag 2640 
cagaagacat gttaaaggtt tatgaagcca 2700 
acctggataa agtggaagct taccgctgtg 2760 
tgaagaagtc gttgttggcc actatgaaga 2820 
ctcagacttc acagcagtat ccactttatg 2880 
tcacacagct gacagaccgc tggcaaagga 2940 
acctggagaa acaaatcaag caattgagga 3000 
agtggctcta tgatcgtaaa cgccgccagg 3060 
ccaacacagt catgcggttt ttgaatgagc 3120 
aacgagacaa atcagaggaa gtacaaaaaa 3180 
attatgagct ccagctggcc tcatacacct 3240 
tcaagaggac catgattcag tccccttctg 3300 
atgctcggta cattgaacta cttacaagat 3360 
tgctgaagag tttggaagat ctgaagctga 3420 
agctcagact ggcccgagat gccaactcgg 3480 
agaacctgca gaaataccag gcagagtgtt 3540 
aggagctgaa gagacaggct gagctggatg 3600 
gctacggcca aataaaagaa ctcaatgaga 3660 
atgaaaagag aagaagaaaa tctgtggaag 3720 
accaactgca gaaagcaagg caatgtgaaa 3780 
ctgagaaagc catcaaggag aaggagtacg 3840 
aagaaggcac ccggaagaga gaatatgaaa 3900 
atgaggagat gagtaattta aggaacaagt 3960 
ccatcaagga gatatccatg caaaaagagg 4020 
atagactttc aagggaaaat cgagatctga 4080 
tcttgcaggc cactgagcag cgaaggcgag 4140 
gtggctctga gataatgcag aagaagcagc 4200 
agcagcgctc tgaggacaat gcccggcaca 4260 
ttcaggacaa aaataaggag atcgagagac 4320 
gccgctggga atatgaaaat gaactgagta 4380 
ttagcttaaa aaatcagttt gagaccgaga 4440 
tcaccatgca gaaggaagag gataccagtg 4500 
gagaaaacag gagcttatct gaagaaataa 4560 
cagagaatct caggagggtg gaagaagaca 4620 
tgtctcagag gaaacagcag ctggaggttg 4 680 
aggagagcgt aagatataag caatctcttg 4740 
acaaggagat agaaaggtta aaacaactga 4800 
tggaagatga aaacgcgaga ttacaaaggg 4860 
gtgcgacgga gacaataaac aaactgaagg 49*20 
tcgactatga aagggtttcc caggagagga 4980 
agaactctct gaaagagctg cagctgcaga 5040 
tgaagaggac cgcgtcagaa gactcctgca 5100 
gcatgaggag gtcgctgaag gagcaagcca 5160 
agcaggcatc cattgttaag aagaggagtg 5220 
tggatggcca cctgagggaa aagcagagga 5280 
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tixtiiii iiZTcTot iiiitiitr ^'^^^^^^^^^ 

aagataaaag cagaagctta aatgaaagca aaa?agaaf? ?S ""'^f ' aaggcgatag 5400 
cagagaacct gaccaaggag cac?tga?g? taoaaSaaa J^^^^^gctg cagtctctca 5460 
agtacgatga cctgaggaga ggacgaaaS aao™^! f^^^^^gaac ctgaggctgg 5520 
tggaactaag gagccalcig caga?S aSacIaa=^ tgataaaaat gcaaccatct 5580 
ttaatgattt acagagagag agggaSftt t^nT^^ cctggaactg caggggctga 5640 
aggctttaga ggca\?tL? a?Stt"1g aa'tLaaga'a Saaf f ttccaaaa^c 5700 
aggaaagaga gagccttctg gtgaaaatca laatnltnnt ^^^^^^tact caggtggtac 5760 
agaggctgga ggatgagctg aaLgSSaJ iTcTclll IT.T°^"^ gcaaggctgc 5820 
aacagcgcct ggagtgtgag aaacagcaaa tSgSga Satca^ f'^^^^^tga 5880 
aatattcccg caaggaggag gctattagga agataoaatc aSff ^ tggaagactc 5940 
gagagaagaa cagtcttagg agtqaaatca ttllt^t ^^^^^^agaa aagagtgaga 6000 
aagagaggtg caggogtafg SgJ^S SaccS^aa IIT'^T" ^^^^^^-tt'g 6060 
cagaacgctc ccgatatcag ag||agattq ItlllTr-Tn f ^^''^^tca cagttagaaa 6120 
atcgagagac ccagactga? t^gagtgga ccS?2oac ^.^^^"^ccca tatgggtccc 6180 
ggctgaggaa gaaggtgaca gcaatgigc tctTtlTJn f ^^^^f^ctg gtgtttgatg 6240 
ccttggacaa actattgaag gggaaa^aat Itlt^^^ tcagctgatc gacaaaacaa 6300 
cattccttcg gggtgcaggl iTatTgTg glfclilTc ^-aatccagc 6360 

rtttggtaga ggccaagaga aagaaaSi ?SaccSof gaaaaatact 6420 

aggcccaggc agctacaggt gg?a?Jatta aSro^ff^ atccacagtc atgcttctgg 6480 
acagtgccat agctcgggac c?cattaac? t^tt ^ gaatgagaag ctgactgtcg 6540 
aaaaagctat cactggmt gatg^^ccaJ Sc^^'"''^ tcagcagata tatgcagcag 6600 
ccatcaagaa aaattigatt gatfoaoaaa gacagtatct gtttcagaag 6660 

cttcaggggg tgtagtagac ?Sg?gaaca gtS?t? acc"''"""^ g-cagattg 6720 
cccgggggct gattgataga gat?tgtatc SccSa^^ f^'^^^^^^at gtcgccttgg 6780 
aaaactttgt ggatccagtc accaaaaaga aaotcS^= tgatccccga gatagtcaga 6840 
gcagaatcga accacatact ggtctgc??? toStS^f "'tgcagctg aaggaacggt 6900 
tccaaggaat cagacaacct g^gaccSca c?aaacS2 t'^ftT^^ agcatgtcct 6960 
cgtccactgt caatgaactg gaatctgrtS aSSff ^^^"ctggt atattgagac 7020 
ttaaggactt cctccaggg? ^ca^gc^Sa tScaaa^!^ ^gaggtt ggtgagagaa 7080 
agaagcttgg catttatgag gccaSaa ItaarffTt ^^^'^^^tgag accacaaaac 7140 
agttgctgga agcccaa|cl gctaSS ?f ccgacctggt actgctctgg 7200 
taccagtgga ggaagccLc aagaSg^^ latlTalll iTfl^'^'' aacttgaggt 7260 
tgtctgcaga acgagctgtc acLgqtata ataaf^o? tgagttcaaa gagaagctco 7320 
tgttccaagc catgaataag gaaS^Jtco ll^nnT^^ aacaggaaac atcatctctt 7380 
cacagatcgc aaccgggggg Itcat^Scc alTJJ.t °^ cggtattcgc ttattagaag 7440 
tagcatataa gagg^gt'!? ttca^tSgg aaS« a^Scf "="^^=9aca 7500 
atgataccaa aggatttttt gaccccaaca rtSaaSf^ f^l l"^ gatccaagtg 7560 
aagaaagatg cattaaggat ?aggaaacao a^ltlf^fit ^'^^^^'^^^^ ctgcaactaa 7620 
agaaacaggt gcagaca?ca LaaaS cScaSaa aSf'^^^ aaagaaaaga 7680 
acccagaaac caataaagaa atgtctgttc ttllaln^^ 9.cgtagagtg gtcatagttg 7740 
atgaaacctt caaagaactg tgSSSaa tJlf^ t caagaagggc ctaattgatt 7800 
gatcagatgg ctccaccagg g?ggtccSa lllTtlatll ^^^^^aaata accatcacgg 7860 
ttcaagatgc tattgacaag ggcS?Sg IctgTalott Itlt^T^"" ^^gtatgata 7920 
gcagoctcag cctcactcaa tttactoaca tntttt^ll ctttgatcag taccgatccg 7980 
gcagcagcat gggcagtggt gtca'gcStg ItlTtltll llT.tT"''' ^'^^^cacca 8040 
taagtaagat ttccaccata tccaocotca aa«^^^ ^ cagctcccga catgaatcag 8100 
cagacaccct ggaagaatcg agcScSS SaS^J^^? cataaggagc agctctttt? 8160 
aaatctccat tacagaagg? a^Jgagcg^g gSSStoa cTa'cT''' aacctggaga 8220 
ttctggaggc tcaggcctgc acaggtgica cagcatcacg ggtcagaggc 8280 

cacttcagga cgcagtctcc cagggtSqa ^ta!c™= ^^^^^^9ggc cagaagctgt 8340 
ctgctcagaa agccttcata ggc??caaoa J^ltr.^ catggccacc agcgtgaagc 8400 
cagaggcagt gaaagaaaaa tggctcSg? SSSa "^^^^^^^9 atgtcagcag 8460 
agtacctcac gggaggtctt g??gacccgg S2?oc2«! ^cgcttc ctggagttcc 8520 
ccatccggaa ggggttcata gatgjccgcg ccqcacaafa ^^^^ataagc accgaagaag 8580 
atgccaaaat cctgacctgc ccclaaaccf allH^tJ ^^^f^^^ accagcagct 8640 
atcgctccat ggtagaagit atcac^gggc tlmtll TatHTr."^^ gatgccataa 8700 

sggc tgcgccttct ggaagccgcc tccgtgtcgt 8760 
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ccaagggctt acccagccct tacaacatgt 
gctcgggatc tcgctccgga tctcgctccg 
ttgacgccac agggaattct tcctactctt 
ggcactag 



cttcggctcc ggggtcccgc tccggctccc 8820 

ggtcccgcag tgggtcccgg agaggaagct 8880 
attcctactc atttagcagt agttctattg 8940 

8948 



<210> 120 
<2n> 587 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 91, 131, 256, 263, 332, 392, 400, 403, 461, 496, 497, 499, 
510, 511, 518, 519, 539, 554, 560, 576 
<223> n A,T,C or G 

<400> 120 

cgtcctaagc acttagacta catcagggaa gaacacagac cacatccctg tcctcatgcg 60 
gcttatgttt tctggaagaa agtggagacc nagtccttgg ctttagggct ccccggctgg 120 
gggctgtgca ntccggtcag ggcgggaagg gaaatgcacc gctgcatgtg aacttacagc 180 
ccaggcggat gccccttccc ttagcactac ctggcctcct gcatcccctc gcctcatgtt 240 
cctcccacct tcaaanaatg aanaacccca tgggcccagc cccttgccct ggggaaccaa 300 
ggcagccttc caaaactcag gggctgaagc anactattag ggcaggggct gactttgggt 360 
gacactgccc attccctctc agggcagctc angtcacccn ggnctcttga acccagcctg 420 
ttcctttgaa aaagggcaaa actgaaaagg gcttttccta naaaaagaaa aaccagggaa 480 
ctttgccagg gcttcnntnt taccaaaacn ncttctcrmg gatttttaat tccccattng 540 
gcctccactt accnggggcn atgccccaaa attaanaatt tcccatc 587 

<210> 121 
<211> 619 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 260, 527, 560, 564, 566, 585, 599 
<223> n = A,T,C or G 

<400> 121 

cactagtagg atagaaacac 
gcctaaccca ggttaactgc 
tgcataaagc caatgtagtc 
tcaatacaca ctcatgaact 
tgcacacttg ctagactcan 
gacaacctac tttgcttggc 
gacatttagt tagtgctttt 
tttccaaatt tttgtacagt 
aatgaagtcc ctggtttttc 
cttaaaacat ctactatatn 
aagtggtggg gaaaaaaaa 

<210> 122 
<211> 1475 
<212> DNA 
<213> Homo sapiens 



tgtgtcccga gagtaaggag 
aagaagaggc gggatacttt 
cagtttctaa gatcatgttc 
cctgatggaa caataacagg 
aaaaaatact actctcataa 
tgagtgaagg aatgatattc 
tatataccag gcatgatgct 
cgctgcacat atttgaaatc 
atggcaactt gatcagtaaa 
gttnanatga aattcctttt 



agaagctact attgattaga 60 
cagctttcca tgtaactgta 120 
caagctaact gaatcccact 180 
cccaagcctg tggtatgatg 240 
atgggtggga gtattttggt 300 
atatattcat ttattccatg 360 
gagtgacact cttgtgtata 420 
atatattaag acttccaaaa 480 
ggattcncct ctgtttggta 540 
ccccncctcc cgaaaaaana 600 

619 
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iii If iS£ ss ai? s=s ?. 

caaagaaatt Saaaa^!'*^ tccaacaagt acttctccJa Stnfnf^ tgtgactgtc 180 

HtF- ir-- '"-IS 

~ sss Igs - 

S" s i "1 - »~ L'S 

cggcSSa 5^"^tt gggggagaat tcaccaccS Sf ^^'^^^^ aagactctga 600 
tcS^^ttg ctS^T °995ggggct ctgtcaccta Tt'JT^^ <=«tggtttg 660 
actaca?cot o^f'^!^^*^ agcgccacac actgcttca? S^f ggcagcctca 720 
ttgagSga LJccSf ttaactccaa cacgcaa^:: "^^^^^^^^g 780 

alSc^^Sg^ iti:ir.T' '^^^ — 

'tgactott^? ctttggaaaa gagaattcta ccgactal-r-f ^^^^tttggc acaagctgtg 1020 
aagtca^cfc' IT.llTctl ^^^-^^^ agt'^calcl ToTotlT' r'^^^-^' ^080 

SS i# SS lis- iii? Si ~ 
= iSiS 5= :S5 5~ H 

<210> 123 ^^'75 
<211> 2294 
<212> DNA 

<213> Homo sapiens 

<400> 123 

cagcgccggc tcgcgccctc ri-nr. 

f« siiS ,~ » 

^So-^SS 2~ - 
SIS f= i5« S'S s-i 

s:£i Si- s» -sH ~- ss?,-* .'.^s 

.tt.ttS SttS; •"'■=•"5' 99cc..„5l 5« 

=1 ~i 'sd' ?^r-- - 

g^gatcagcg ccacacactg cttcati-^^^ ^^^ggaggca gcctcatcag cccttacl-^^ 7on 

cSaS:K -^^---S X'ggai: iT'jr' -^-"'S' 78? 

cSScc :,^^^9gacta cagcgctgac acgStgctc !f ^^"^^ ggtggaaaac 840 

tgcctg'^cc^^ UllZti: IIT'"^'' 9c?cagS L^^^JJ^t 2^*=^^*^^ 
ggaaaagaqa a^f^r °9atocccag tttggcacaa ac^t^ 1 acagaccatc 960 

ctgatttccc ac^^^''!?^ ctatctctat ccggagcagc ta!f ^f^^*' «ctggcttt 1020 
atgctgtgtg ctalf^^^^ tcagcagccc cactactacg Ic^T^^^ tgttgtgaag 1080 
cccctcgtc? SSctcf ^^^^99^-- acagattS gcca!Sf ' ''r^^^'^"^^ ^^^^ 
ggatgtlccc ?aiaaLr^ aggccgcatg actttgactg oaaS^^ '^tcaggggga 1200 
atcclclg^c acacof g<=caggcgtc tacac|agag tctJ=^!'f5 ^^^^ggg^cgt 1260 
cgggLcSc ccgctttctt llt'T^^'' ^99clc?o! ga'ggScc f!^^^-*^^ "20 
taagaagagc .ggU^al^ --.ca^ -~ -0 

^ gcctgtgcca ccaccagggc 1500 
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gaacgacaat agctttaccc tcaggcatag gcctgggtgc tggctgccca gacccctctg 1560 
gccaggatgg aggggtggtc ctgactcaac atgttactga ccagcaactt gtctttttct 1620 
ggactgaagc ctgcaggagt taaaaagggc agggcatctc ctgtgcatgg gctcgaaggg 1680 
agagccagct cccccgaccg gtgggcattt gtgaggccca tggttgagaa atgaataatt 1740 
tcccaattag gaagtgtaag cagctgaggt ctcttgaggg agcttagcca atgtgggagc 1800 
agcggtttgg ggagcagaga cactaacgac ttcagggcag ggctctgata ttccatgaat 1860 
gtatcaggaa atatatatgt gtgtgtatgt ttgcacactt gtgtgtgggc tgtgagtgta 1920 
agtgtgagta agagctggtg tctgattgtt aagtctaaat atttccttaa actgtgtgga 1980 
ctgtgatgcc acacagagtg gtctttctgg agaggttata ggtcactcct ggggcctctt 2040 
gggtccccca cgtgacagtg cctgggaatg tattattctg cagcatgacc tgtgaccagc 2100 
actgtctcag tttcactttc acatagatgt ccctttcttg gccagttatc ccttcctttt 2160 
agcctagttc atccaatcct cactgggtgg ggtgaggacc actcctgtac actgaatatt 2220 
tatatttcac tatttttatt tatatttttg taattttaaa taaaagtgat caataaaatg 2280 
tgatttttct gatg 2294 

<210> 124 
<211> 956 
<212> DNA 
<213> Homo sapiens 

<400> 124 

gatgagttcc gcaccaagtt tgagacagac caggccctgc gcctgagtgt ggaggccgac 60 

atcaatggcc tgcgcagggt gctggatgag ctgaccctgg ccagagccga cctggagatg 120 

cagattgaga acctcaagga ggagctggcc tacctgaaga agaaccacga ggaggagatg 180 

aacgccctgc gaggccaggt gggtggtgag atcaatgtgg agatggacgc tgccccaggc 240 

gtggacctga gccgcatcct caacgagatg cgtgaccagt atgagaagat ggcagagaag 300 

aaccgcaagg atgccgagga ttggttcttc agcaagacag aggaactgaa ccgcgaggtg 360 

gccaccaaca gtgagctggt gcagagtggc aagagtgaga tctcggagct ccggcgcacc 420 

atgcaggcct tggagataga gctgcagtcc cagctcagca tgaaagcatc cctggagggc 480 

aacctggcgg agacagagaa ccgctactgc gtgcagctgt cccagatcca ggggctgatt 540 

ggcagcgtgg aggagcagct ggcccagctt cgctgcgaga tggagcagca gaaccaggaa 600 

tacaaaatcc tgctggatgt gaagacgcgg ctggagcagg agattgccac ctaccgccgc 660 

ctgctggagg gagaggatgc ccacctgact cagtacaaga aagaaccggt gaccacccgt 720 

caggtgcgta ccattgtgga agaggtccag gatggcaagg tcatctcctc ccgcgagcag 780 

gtccaccaga ccacccgctg aggactcagc taccccggcc ggccacccag gaggcaggga 840 

cgcagccgcc ccatctgccc cacagtctcc ggcctctcca gcctcagccc cctgcttcag 900 

tcccttcccc atgcttcctt gcctgatgac aataaaagct tgttgactca gctatg 956 



<210> 125 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 16 

<223> n = A,T,C or G 
<400> 125 

aaattatata tagtgnttca gctcccattg 
acttaagtat tcaattcact cttggcattt 
ttggaaaact gcttttcttc tgagaacctt 
tctaagtcca gagctaactt agtactgttt 
tctgtttagc cagtgttacc aaggtaagct 
gcattttagg acattatggc agctttagaa 
agcgcaggtt ttggatacta gagaaagtca 
tctgatgtga attcaaattt tacctctgtt 
tttact 



tggtgttcat agtcttctag gaacagataa 60 
tttctttaat ataggctttt tagcctattt 120 
attctgaatg tcatcaactt taccaaacct 180 
aagttactat. tgactgaatt ttcttcattt 240 
ggggaatgaa gtataccaac ttctttcaga 300 
ggctgtcttg tttctagcca agggagagcc 360 
tttgcttgta ctattgccat tttagaaagc 420 
acttaaagcc aacaatttta aggcagtagt 480 

486 
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<210> 126 

<211> 3552 

<212> DNA 

<213> Homo sapiens 

<400> 126 



gctgctgacc ctcgtgatc? tLgtc^gl tSafo^^ Sgagccgtct gcctgcat?? 120 
clTc^T"'' ^^^^-^-^g cagLaSa? alSggS ^gatacttaa 180 

caggtctgca gacctcatcc ggtcaagtga tcctllttft ^^^^^^^^^9 aagagtgctt 240 
agtgtacaca gccagggctg ttgcgcSJc taSfa^^!! ^^agttctaa atgatgggtc 300 
ttctgacaaa aggaaacaga caLgaaJga g??^aSata c?a ^'"^ ocatatggct 360 
ggtatcgaag acaagacaca ctagaaaaac t^^^^r^^ ctgctagaac atcagaagaa 420 
acctattcct tgctctatgc aaSga'tc cSgggcS t?c"%\'^^ ^^agatg^gc 480 
tf^Jf «^*9*«g°a° agaactatac SJ???ctac t^f^ f ^tcttcaaca 540 
tgataaagaa cotttaaatt tgttttatat anlfr ^caataagtg gacgtggagt 600 
tcggcctgtg gatcgtgaag aata^gSjt tlttatttt" ^^^^^aaatc tat?t?gcL 660 
agatggatat tcagcagatc tgcccc^cJc acllla^l^ attgcttatg cgtcaactgc 720 
caaccaccct gttttcacag aagcaattta telttl^ll a?fS'^"^^ atgaaaatga 780 
^^^^^f gtgggggtgg tttgtgccac agacaSaaJt atl^^^^^ gtagtagacc 840 
gcgcctgaaa tacagcattt tgcag^gac alclllnf^ gaaccggaca caatgcatac 900 
tcccagcaca ggcgtaatca ccacS??c SSt^^ cctgggctct tttctgtgca 960 
gtactcattg ataatgaaag tacaaScat ItttJ. ^ gacagagagg ttgtagacaa 1020 
atcaacttgt atcataaca? ta^^c "^"^^^at tgataggcac 1080 

tgcttatgaa gcatttgtag aggaaLtgc JJtcaa^a?^ ^^^^P^^" tcagacaaaa 1140 
agaagataag gatttaatta acactcjccaa Ttlnl^l^ ffaaatcttac gaatacctat 1200 
aaatgaaaat ggacatttca JaltcagcJc aaf.^^^^ ° aattttacca ttttaaaggg 1260 
tgttgtaaag ccactgaatt atg^ajaaa^ cc^^f actaatgaag gtgttctSc 1320 
caatgaagcg ccatttgcta gagS?^ tXXfal '^T.'"'-^ ttggagtaaa 1380 
tacagttcat gtgagggatc tg|atgaggg g^^tSatac f^^^^^^^ gagccttggt 1440 
gcggattaaa gaaaacttag cagtggggS aaaa!^™ actcctgcag ccoaatatgt 1500 
cgaaaataga aatggcaatg gtSSggL caafaaa??o ITT^"^ <=atatgaccc 1560 
aao^or^^"^ 9aaatttcag ggtcaa?cat aaoticllll atcnf ^^^^ttggat 1620 
at2=o^^^ aatgagttgt ataatattac agtccWa T.IJ^^^^ gggaggttga 1680 
atgtactgga acacttgctg tgaacattga aoataSfff ^^agacaaag atgatagatc 1740 
tcaagaatat gtagtcattt gcaaacc^S JSaSS gataatccac cagaaatact 1800 
tcctgatgaa cctgtccatg gagctcca?? uSSS fr^^""*" tagctgttga 1860 
aatcagtaga ctgtggagcc tcaocaaagt taaSataS *^*9^="=^ata cttctccaga 1920 
gaaaaatgct ggatttcaag aatataccS tlTtltllS gctgcccgtc tttcatatca 1980 

taoif^?^^ aaattattga gagttaatct g?Sgaat2 ttrT^""^ gggccggcca 2040 
tgcgacttca aggagtacao oaot;=,;,i-=.^-K ^'^^^^aatgt actcatccaa otcagtatca 2lon 
gggtatagca ctgcLttS ItgtaTgal aa^ttf ^ f^'"''''^^' caatattac? 2160 
taaagggaaa cgttttcctg aagatttagc aSSSSc tST.^.^'l ttggtgcaac 2220 
agcacctgga gacgatagag tgtqctctac o^f? f • ^^^attatat caaacacaga 2280 
ctctagccaa ggtttttgtg gSStgga at^^^f atgacccaaa ctaccaacaa, 2340 
cattgaaatg atgaaag^ag ^a^S cttgJIaS ^^^^^^^^ ggcaggaaac 2400 
tcataccctg gactcctgca ggggaaalcf f tgccgggggg ctgggcatca 2460 

ctcggagtgg cacagtt^ta cSacScg ^ctlllllll ^"'^fft^-- gatacactta 252? 
gaatgaagac cgcatgccat cccaagatta ^gS?cfct tat. gatgtaatca 2580 

atctccagct ggttctgtgg gctgctacaa tal^r ^a^aactatg agggaagagg 2640 
tttaaataat ttggaacc?^ aatttat^c SaJS S"'^"*^ 90c?tgL?? 27^0 
tcacagtgct acaattaggt ctttqtcaoa allt^^^ gcatgcacaa agagataatg 2760 
aaagttcaat ttcaacatgt atgJSatS tlll^Ttl ^"^^^^^^ ataatattg? 2820 
SX' ~ I ~ ^3.0 

«,cto«„ a«.„,g. 3„„„ 
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aacaatagct aagttatgct aatatcacat 

aaaaataaac aagaaatatt gagtatcact 

gaagactgaa ttaaattaaa aatgttgcag 

actaccaaat tcatttgact ttggaggcaa 

ttttctatag gaatatagtt ggaaataaat 

aatatttaaa tgaaatgaga acaaagagga 

tatagtttgt cctacaatag aaaaaagaga 

gcattataac tgagtctatg aggaaatagt 
ttgtaaataa at 

<210> 127 
<211> 754 
<212> DNA 
<213> Homo sapiens 



tattatgtat tcactttaag tgatagttta 3120 
atgtgaagaa agttttggaa aagaaacaat 3180 
ctcataaaga attggactca cccctactgc 3240 
aatgtgttga agtgccctat gaagtagcaa 3300 
gtgtgtgtgt atattattat taatcaatgc 3360 
aaatggtaaa aacttgaaat gaggctgggg 3420 
gagcttccta ggcctgggct cttaaatgct 3480 
tcctgtccaa tttgtgtaat ttgtttaaaa 3540 

3552 



<400> 127 

tttttttttt ttgtcattgt tcattgattt 

gcctttcaaa ggtcacacag aagtaagtga 

gctctagtgt ccatgcttct caaccattat 

ggacacgtga aatgtatccg gtattttact 

gaagacatac acaaaaataa tggttacaat 

acctatatta aaatgtaagg cttttgatat 

aacgtttgta ggggagcaca ctcctgcatg 

cctttatggt tggatcatct tgtcattaaa 

gaatcaagac tgcaatatcg cctgcttttc 

tggtcctcaa agtaaaaccc ctgtgtcagt 

accaccttct aatactttta atacccaatc 

ctcatctgta aagctgtgct tcaaaatagt 

atgatgtcga acctgcccgg gcggccgctc 

<210> 128 
<211> 374 
<212> DNA 
<213> Homo sapiens 



taatgagaaa gctaagagag gaaataagta 60 
cagatccagg attcatatcc aagcattctg 120 
gacccaatat tcaaccaaat caatactgaa 180 
attacaaaca aaaatccaat gaacattctt 240 
agaagttact ggaattgaaa ttttggttca 300 
agctaataga tttttgaaat gatcagtctt 360 
gggaaaagat tcactgtgaa gcacagagca 420 
gttcaggcgt tatctatcct gtaagtggca 480 
tttttaactc atgttttccc ttgactacac 540 
gtactattca tggaatactc tgcaattata 600 
aaaatttatt atacatatgt atcatagata 660 
gatctcttcc caacattaca atatatatta 720 
gaag 754 



<400> 128 

aggttttgat taaaaaggca aatgatttta 
gaaggagtaa aattaaagat gaaagatgat 
ttcccctgcc cttggtaagt aactcttgat 
aaaaaacaga gccagctaat catttccaaa 
ggtttaattg aataaaacta tatgttcata 
tttcttcctt agttaaggca ttataagggc 
aacttaaaaa gctg 

<210> 129 

<211> 546 
<212> DNA 

<213> Homo sapiens 



ttgttcgata atcttttaaa aaaataagag 60 
ttttatttcc ttgtgacctc tatatccccc 120 
ggagaaagga ttaaagactc ttatttaacc 180 
ggttagtatc tccctgctga cctcttcttt 240 
tatgtattaa aacaactcag aataacatct 300 
tatactatca tccataataa ccaaggcaat 360 

374 



<400> 129 

agtgtgatgg atatctgcag aattcgggct 

tcccagcacy tgaaaaggag cctcctgagc 

cctcatttct gcctactgat ttccttggag 

aacctggtac atacatagca tgactccctg 

gagagtgatt gacatgcact ttcaagctat 

acctcgagta aattccatca ttttttataa 

tcagcgtaac aggatctcca gtctctggct 

tgggataaaa tccctgtttc acattggcat 



aagcgtggtc gcggcccgag gtctggaact 60 

tgactcggct aaagccccac tttcgctcct 120 

cattcatctg aatattaccg tttgctgtgt 180 

gaatagagtg ggctggggtg cttatgctgg 240 

atctaccatt tgcagcaaag gagaaaaaat 300 

catcagcacc tgctccatca tcaaggagtc 360 

caactgtggc agtgacagtg gcattaagaa 420 

aaatcatcac aggatgagga aaatggaggc 480 
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tgtctctttc cacaaaggct tccacagtgg ctgggggcac agacctgccc gggcggccgc 540 • 

546 

<210> 130 
<211> 5156 
<212> DNA 

<213> Homo sapiens 
<400> 130 

l?-tll- ~lt -~ a^aga cggeccggca .0 

alSa- 2~ S« - 

acactgatgg cattcactgc g^gaa^Sa lllt^lttf tgcaatgaca 240 

accgctgttt gccctgcaat tgLaS^S aS?I^^J ttaccggcac agagaaaggg 300 
ccggacggtg cagctgtaaa clagSS Sa^n^f ^f^tgctcga tgtgacaact 360 
gcttccacat gctcacggat gcggjjg^a ccSaaf^.^ atgcgaccga tgtctgccag 420 
gtgactgtga cccagctggc atcgcllTol Ic^tn ^^^^^t^cta gactccaagt 480 
cagctgtcac tggagaacgc tgtgaSgS SaSaan .^^^^^^ctgt gtctgcaagc 540 
ggaaccctga gggctgtaco cag?gtt??J qcStaa^o! ctggatgggg 600 

ctgcagaata cagtgtccat aagatcacrt ?ScS?f^^ ttcagccagc tgccgcagot 660 
aggctgte,a acgaaatggg tcLtgcaa agcSSta ^tcaL''^" ^atggctgga 720 
tgtttagctc agcccaacga ctagacccta tof^^fff 5 gtcacagcgc catcaagatg 780 
ggaatcaaca ggtgagctat ggJSafgc? t^cc^t"S llT'^Vr ^^^^^ttg 840 
gcagacaccc atctgcccat gatgtgattc SaaaaaJa. ^^^^=^9^9 gacagaggag 900 
ccttgatgcc acttggcaag Lactgcct? ItlaaJr-t ^^^^^^^^^g atcacagctc 960 
taaatgagca tccaagcaa? aattgga jc IcllaTtaaa tt^^.%^''^° acattcaggt 1020 
tactgcggaa tctcacagcc ctccgca^cc gSacatf tatcgaaggt 1080 

acattgacaa tgtgaccctg atttcagccc gcSot^tn J^^^?^^*^^ agtactgggt 1140 
ttgaacagtg tatatgtcct gttgggfaca agaaacaSt '^^^^'^'^^^^ gcaccctggg 1200 
gctacaagag agattcagcg agaciggggc cStta^S^ ctgccaggat tgtgcttctg 1260 
aagggggagg ggcctgtgat ccaqaScao a»« ^f^^f ctgtattcct tgtaactgtc 1320 
acattgagtg tgctgLtgc ccSttggt? t'Saacaa' f^^^^^^^^ gagaatcctg 1380 
gcaagccatg tcoctgtcat aacggg??ca Ztaata.r.t ^^^^^^"^^ac ccccgcagct 1440 
• tggtgtgcaa taactgccct cccggggJca cTaatToT ^fgccggag acggaggagg 1500 
gctactttgg ggaccccttt ggtgaacata aatrllt^ ctgtgagctc tgtgctgatg 1560 
gcaacaacaa tgtggacccc fgtgJctc^g ?aaatSl«^ gccttgtcag ccctgtcaat 1620 
tgaagtgtat ccacaacaca gccggcaS aclToltltt "^^^^ctgaca ggcaggtgtt 1680 
gggacccatt ggctcccaac ccagLgaca aSatca!^ gtgcaaagca ggctacttcg 1740 
gotcagagcc tgtaggatgt cgafgtgatg gSStotat "^"^^^^t aaccccatgg 1800 
gccccaactg tgagcatgga gcaticlgc? JScfaSf^ "gcaagcca ggatttggtg I860 
agatggatca gtttatgcag cagctS^ oaTalTnl^ ctataatcaa gtgaagattc 1920 
gtggtgatgg agtagtacct gaLcaSgc tggaSf ^^tgatttca aaggctcagg 1980 
cccttcagga cattctgaga oataccr;,L ^ gatgcagcag gctgagcagg 2040 

tccagttggc caaggt^agg a'gccaaS toaTctlaal '^^^^^"^^ tccct?gg?? 2100 
agatgactgt ggaaagagtt cgggctctaa tl^ltllt ^^^^^^^t? gatgacctca 2160 
ctcacaggct catcactcag a?g?agS^f TclllaTat ^^T^^'^^a gttcgggata 2220 
aoaal"^'"' tcctgcctca gaLacta^g tgggSaa Tatttt^''^ ^ccttgggaa 2280 
aggaggccac aagattagca gaaagccacg tt?a?t™ ^f^f ^'^^^^ agtctggctc 2340 
caagggaaac tgaggactat tccaaacaag ccctcS^^ cagtaacatg gagcaactga 2400 
aaggagtcgg aagcggaagc ggtagcccaa Tra^^^ f ^ ggtgcgcaag gocctgcatg 2460 
aattggagaa aaccaagtL ItgllToTgl aSScaaa ^^'^^^^^^g cttgtggaal 2520 • 
ttgaagcaga taggtcttat ca|cacag?c Soccton? ^^^^gccact caagcggaaa 2580 
agggagtcag tgatcagtcc tttcaaataa laJ.t 99attcagtg tctcggcttc 2640 

attcactctc aagoctggta accaggLS tgStaaatt caaaaagcgg 2700 

tgggaaactg gaaagaagaa gcacagcSgc ^c?Sca^" caagcgtaca cagaagaatc 2760 
aatcagatca gctgctttcc cgtgcJaatc ^Jactaaaaf gggagagaga 2820 

gtatgggcaa tgccactttt tatJaaSJg aSooat!^? cagagcacaa gaagcactga 2880 

gaagrtg agagcatcct taaaaacctc agagagtttg 2940 
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acctgcaggt ggacaacaga aaagcagaag 

tcagccagaa ggtttcagat g.ccagtgaca 
gcgctgctgc tgatgcacag agggcaaaga 
gtgagattga acaggagatt gggagtctga 
ccttggccat ggaaaaggga ctggcctctc 
agctggaaag gaaggagctg gagtttgaca 
cagaagccca gaaggttgat accagagcca 
tcaacacatt agacggcctc ctgcatctga 
ggctggtctt actggagcag aagctttccc 
ggcccatgat gtcagagctg gaagagaggg 
tggagacaag catagatggg attctggctg 
acctgccccc aggctgctac aatacccagg 
atttctcaac tgaggttctt gggatacaga 
tgggtgggat ggggacattt gaacatgttt 
cccattcctg atcccatggc caggtggttg 
tgctgggcaa tgaggcagat agcactgggt 
gaatagactg gatggaaaga caaactgcac 
gtggagtcct ggaatttgga caagtgctgt 
tgtgactaaa ggaaaaaact ttgactttgc 
cagagtgcaa cccagtcaca ctgtggccag 
caagcttctt gctgatcaga gttcctccta 
attttcaagc tggaagaagt gagcagtgtt 
agagctatgg tgcttgctgg tgcctgccac 
tttcttttaa tgatgccatg gcaacttaga 
agcaaagcaa atgttgggaa agtatttact 
gcttgggcat tgaaagaggt aaaattctct 
ttagaacacc aaaaatgatg cgcatcaatg 
tctttcctcc acccataata agagaatgtt 
tccctccatt catccttcca tccatctttc 
tatatttatt gagtacctac tgtgtgccag 
tgccctcata gagttgattg tctagtgagg 
acttacaaac tttgtttgtc acaagtggtg 
tctttgctca acagaacata tgttgcaaga 
aggctgacag agctctgggt tgtgcacatt 
ttctacaact gattgcaaca gactgttgag 
gaaccagagg cacttccacc ttggctggga 
ccttggattt tcctgaaagt gtttttaaat 



ctgaagaagc catgaagaga ctctcctaca 3000 

agacccagca agcagaaaga gccctgggga 3060 
atggggccgg ggaggccctg gaaatctcca 3120 
acttggaagc caatgtgaca gcagatggag 3180 
tgaagagtga gatgagggaa gtggaaggag 3240 
cgaatatgga tgcagtacag atggtgatta 3300 
agaacgctgg ggttacaatc caagacacac 3360 
tggaccagcc tctcagtgta gatgaagagg 3420 
gagccaagac ccagatcaac agccaactgc 3480 
cacgtcagca gaggggccac ctccatttgc 3540 
atgtgaagaa cttggagaac attagggaca 3600 
ctcttgagca acagtgaagc tgccataaat 3660 
tctcagggct cgggagccat gtcatgtgag 3720 
aatgggtatg ctcaggtcaa ctgacctgac 3780 
tcttattgca ccatactcct tgcttcctga 3840 
gtgagaatga tcaaggatct ggaccccaaa 3900 
aggcagatgt ttgcctcata atagtcgtaa 3960 
tgggatatag tcaacttatt ctttgagtaa 4020 
ccaggcatga aattcttcct aatgtcagaa 4080 
taaaatacta ttgcctcata ttgtcctctg 4140 
cttacaaccc agggtgtgaa catgttctcc 4200 
ggagtgagga cctgtaaggc aggcccattc 4260 
ettcaagttc tggacctggg catgacatcc 4320 
gattgcattt ttattaaagc atttcctacc 4380 
ttttcggttt caaagtgata gaaaagtgtg 4440 
agatttatta gtcctaattc aatcctactt 4500 
tattttatct tattttctca atctcctctc 4560 
cctactcaca cttcagctgg gtcacatcca 4620 
catccattac ctccatccat ccttccaaca 4680 
gggctggtgg gacagtggtg acatagtctc 4740 
aagacaagca tttttaaaaa ataaatttaa 4800 
tttattgcaa taaccgcttg gtttgcaacc 4860 
ccctcccatg ggggcacttg agttttggca 4 920 
tctttgcatt ccagctgtca ctctgtgcct 4980 
ttatgataac accagtggga attgctggag 5040 
agactatggt gctgccttgc ttctgtattt 5100 
aaagaacaat tgttagaaaa aaaaaa 5156 



<210> 131 
<211> 671 
<212> DNA 
<213> Homo sapiens 

<400> 131 

aggtctggag ggcccacagc cggatgtggg 
ttttgcatcc cggttgcagt gtgttgcaga 
cctgggcagc caycacgagg atcatgactc 
tcccgatgct ggtggagtgt ttgttgacac 
ccattgcgct ggtttatccc tgagtcctgt 
agaatgaggg caagatccct ctgcgagggt 
tgcctagaag ccaatgggtg cacagtg^tg 
aggatgtcgc ctggaatatt caaattgaat 
gaatttttct ttcgccatac agaaattgtt 
cctgaccctt cctgctcccc aggaagggag 
cgtgacaccg gagacaggtc ttcttcaccg 
ttaactgcta t 



acaccgggaa aaagtggtca tagcacacat 60 
cgaagtcctc ttgctcgtca ccccacactt 120 
ggaaaataaa gatgactgtg atccacacct 180 
ccccgatgaa agtgtgcagc gtcccccaat 240 
ttccaacgac tgccagtgtt tcagacccaa 300 
ttcagacctc cttctcctac cccactggag 360 
atacgaatgt caatctttgc tcggtcagtg 420 
tacagatgca tgaagagggc gtacaagtta 480 
tagccagatc ttctgtactt cttttccttc 540 
gtcagccccg tttgcaaaac acaggatgcc 600 
acaggaagtg ccttctggtg cctgcacgtt 660 

671 



wo 02/00174 

PCT/USOl/21065 

62 



<210> 132 
<211> 590 
<212> DMA 

<213> koitto sapiens 
<400> 132 



SSSSt. Tcmillll titllttT ^^r-- t so 

?9ttc^tctg cagccccaaa t^tggag^a SSacS^ ^aattcactc cattgccctg 120 
tttgttccag atatatcaaa ctccaataac ^I^!^ ttacaggagg tttaaagttc 180 
ggaactggag acattttcca gcaacS ctttcagtag aatttcctct 240 

cctcaccatc aattgaaaaa ScSga^ SSaSaS 'IT''^^' aaatgtcaaa 300 
tttctagtta cgtggcaggc cagtgg?cct S?aS«fj! ^tgtgggcaa cgacactatg 360 
cgaaaatact acacaaataa tt?taSS aaSjt? '^"^^^^^ tcctgatggi 420 
attccaggaa cagctaagcc tgggcactgg actJacaS^ ""^^gacago tagtctttgg 480 
ctgcaagccc tgaaagtgac agt^cctS ZToTci: aSga^^? ^ 



<210> 133 
<211> 581 
<212> DNA 
<213> Homo sapiens 

<400> 133 



aitt ?SSa1 llttTclol tlltlir^ "--^^^ ttggggagag 60 
ctgtcctcat ctctgcaaag ttcagcScc Jtco^«''°! ccctgtaccc tagcctgcac 120 
atgctctggg gagctcatgg gtgjaqaaat It^T^^^ ctctgtgcac tctgtcttgg 180 
tgggccaggg atgaatat?? SjJgS aat?S ^'^''^^^^^^^ ggggactg^^ 240 
gggggagaac agagaggagc ?gggctatgg gaaaSStt f ^^^^^-^^ attggtagta 300 
tggctggata tctggtacta aaaaaggg?? ???JaSJ" f ^^aatgga gctgggaata 360 
aatccaaacc atagctgtct gtccartoct rf ^ f '^^^"cctaa tctcttcccc 420 
cctcctagac tctgtccctg ggS^ca a^2f ^^"^ ctccagctct gccccaggct 480 
ggctgaggag agtgtgaca? SSggagag SSS? T'^^^^^^ "gggg^ga 54S 

<210> 134 ^^"^ 

<211> 4797 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

tllll 

j> n = A, T, C or G 

<40Q> 134 ^ 

ot?s s~ SIS iSccS^j » 
ssis ™? ™s KSS? I?— - 

tctccggctc aggtgcagg^ ola^tTrt f ^^^^^^^^^ tggggatgac tctgalaart 240 
ggtcatgcct ggg'gg'caK Sca^gSgt cS.'tt ta'cT^'^^" ^ac^gcaaS 3^0 
caccctgcct gccctgtctc caoccaacta ^^r^!^^ tactgagcat gtaotgagtg 360 
ggggtcgtga gctgc^gtta aggag^gctc aScttaafa ggagaggS? 420 

tccagaaggc cccagggcgc nSgctgcac gcSrtcat aJff .'^^^ ggctgtgagc 480 
ctcactaaga aacctctgga acccccttca atl^ltltl f*<=°actagg aatagcttta 540 
tctcatctgc aaaatgggL taatSttg fcSgaSa^ ^iT"^^' gcctctattt 60S 
cacagagcca gctggggtgt agctcttcca^ c\"SS? tt^^SS 7IS 
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tgtggggact 
aggtggctct 
tcatccctgg 
tggaagtgtc 
tgtgtgccct 
cacccctgtg 
gagacaggct 
gagcttagcc 
gcccaggtgc 
accccccacc 
cttgtcacag 
gtctccagaa 
agaggggccc 
ccgggagcag 
ggcctcaacg 
catgcagcct 
cactccccac 
ctccagtcag 
ttgggaaatt 
ctgcgcgatc 
ccagggcctc 
atcacccaag 
aacgttatac 
get at teat a. 
ceagggtccg 
ttgggtgcat 
gtagtggagc 
cccacaggac 
tgaccgccgg 
ggacaggaaa 
getgctgtgg 
cacgagagcc 
tgccggaggc 
catgaagaag 
ggcctaccag 
ccctccgccc 
tggcctcccc 
cacggagtcg 
tagggcacca 
ccattctgac 
gagggggatc 
ggggcttggg 
ggccgctgag 
tcgacttgtt 
aettetgagg 
tggtcgggag 
ccaaatctgg 
atgtgtgcaa 
ggctggccag 
catgcgctca 
aagtgaatgc 
tggagatcct 
cctcctggaa 
ggccaggttc 
attttccaaa 
ttccctgtac 
ccactgcctt 
accaacatgt 



gggggagaga 
tgagtggacc 
ggaacftgacc 
tgttgttgga 
gggataagta 
gtcacagaaa 
gcacctgaca 
tcagctgcct 
tctggagcct 
tccccaacac 
eagaccccct 
cccaccggcc 
aaggagggag 
gaggccaccc 
accacagcca 
ggc caeca tg 
acagaggatg 
ctcccagcag 
gagtgggttg 
tcgtattcct 
gcagagcagg 
agagggctcc 
cagtcatttt 
eaaaatgtgt 
gagttgatgt 
ctgagtgggt 
tggttgctgc 
ttcacctttg 
aaecagtccc 
gaggtgetgg 
ggtcagggtg 
caaggagccg 
ctcgtggggc 
aaggacgaag 
aagcccacca 
tgecactcac 
tgeeaeeagg 
tgggtgtgct 
ggggtttctc 
tcggtttctc 
cgactgcttt 
gctca cacao 
tggcagggga 
tttgcacatg 
taagttaagt 
acagcatcag 
aagccaaaat 
cagggtatgg 
tccaggctgc 
gggccatgct 
tgggactcag 
ectgeaggct 
attgacgagg 
tccgttagct 
gagtgatagt 
agtagatttt 
cgcttcttgc 
tgggaaactt 



agtccctgag 
tcaggaagag 
tagcggaggc 
agtgggggcc 
ggataaccac 
agctttccca 
cacacaatgg 
tgtccaggta 
cccccgacce 
actctgcttc 
ccacttggaa 
tggaggctac 
aggctgtagt 
cccgacccag 
ccacggccca 
agacctcaac 
gaggtcettc 
cagagggctc 
gtcctaatgc 
caccaggaag 
acagactaac 
caaactcaca 
atttatagct 
gctttgtatc 
ggcaagaagg 
ggtggcaaag 
tgctggcggt 
aaacctcggg 
cagtggatca 
gaggtgagtt 
gggctgacca 
ctgagctgag 
tcatctttgc 
gcagctactc 
aacaggagga 
taggccccca 
ccacctcccc 
gggagctcca 
gcataggacc 
caaactgaag 
ggacctaaat 
ctgtagcact 
caggagtcac 
tttcctctag 
aagttgattc 
ggttaagaag 
gtaggcttag 
actatctgtc 
cgtggggccg 
gaggcctggg 
cettcagaca 
cacgcccgte 
ggtgtcttgg 
cctgtggccc 
cttttgcttt 
ccaaatgtaa 
ctctgtgctg 
ggctcgaatc 



ctggaggtgg 
gggtgagaga 
ctgagagctg 
tttttttcag 
agtagttatg 
ggtggcctag 
aaggacagct 
etagcctccc 
acccaacaca 
tggtcctgea 
ggacacgcag 
agctgcctcc 
cctgccagaa 
ggagaccaea 
ggagcccgcc 
ccctgcagga 
tgccaccgag 
tggggagcag 
ctggcacttg 
acagggcaca 
tatgagatca 
atccaaactt 
tcgtggattt 
aetttttgtg 
cctggctttc 
atcagggagg 
gacctggcca 
ggagaatacg 
gggggccacg 
ttcttteagg 
cagccaaggc 
ctggccccgt 
tgtgtgcctg 
cttggaggag 
attctatgcc 
cttgcetctt 
agcattccag 
ctctgcttct 
tttccaccac 
cagcctctcc 
ggcctcatgt 
tactggtagg 
tttgtttcgt 
ttctttgttc 
ggtatccccc 
actttttttt 
tttgtgtgtt 
tggtggcccc 
ccgcctcttt 
ccgetgecac 
gagaggactg 
ctcctgtggc 
gcagagctgg 
caccctgggc 
tggcaaaact 
taaactttaa 
tgtgtgacgt 
tctgtgcctt 



tcagggaagc 
gctaaggaag 
caaggtagga 
ggagggtggg 
cccctaaggg 
gcacctgtct 
ctccttgtcc 
tcatagcctg 
ctctgcttct 
ggtgctttgc 
etcctgacgg 
acctccaccc 
gtggagcctg 
cagctccega 
acctcccacc 
cecagceaag 
agggctgctg 
gtgagtggcc 
gcaggcccta 
ggggccgcct 
gagcagaagc 
gcagccctcg 
acgcttacac 
atatecatgc 
gggccctgtg 
caggagctgc 
acccaatctg 
gctgtagtgg 
ggggcetcac 
ggggtagttt 
cactgctttg 
ctacctgccc 
gtgggtttca 
ccgaaacaag 
tgacgcggga 
ccttgaagaa 
cccctctggt 
ctgacttctg 
agccagcacc 
ccaggtccag 
ggctggaaga 
accaagcatc 
ggggaggtet 
atagcccagt 
catcttgctt 
ttttttttaa 
gtctcttgag 
gttctggtgg 
caagcagtcg 
gttggagaag 
tagggagggc 
gccgtctcca 
ctctgagcgc 
cctgggctgg 
ctacttaatc 
tataaagtag 
gaccggactt 
cgtctttcce 



ttcacagagg 
gaggctgagg 
tatctgttgt 
gccagagaag 
atgcccaccc 
cgtggctcca 
attttccaag 
agcttggcea 
ggtcctcccc 
aagatatcae 
ctattcccac 
tgccggctgg 
gcctcaecgc 
ceactcatea 
cccacaggga 
ctgaccttca 
aggatggagc 
tctgcattcc 
cacctgtgcc 
tcccctaccc 
acccttaaag 
tcgaagagtg 
taaatagtct 
catggtccag 
cgatcctggt 
ttctgggtct 
cccctgccct 
ccgtggagcc 
agggccteet 
ggggtgaatt 
ggagggtctg 
taggggtcat 
tgctgtaccg 
ccaacggcgg 
gccatgcgcc 
ctgcaggccc 
cgctcctgcc 
cctggagact 
tggcatcgca 
ctctggaggg 
tcctgcgggt 
ttgggggggt 
aatctagata 
agaccttgtt 
ccctaatcta 
actaggagaa 
tttgtcgctc 
tctgttggca 
tgcctgtgtc 
cccgtgtgag 
ggcaggggcc 
ggggctgctt 
ctccatccaa 
aatcaggaat 
caatgggttt 
tctgtgaatg 
ttctgcaaac 
atggggaggg 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 
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sitiToZi Tsniiii ,^srTf "j^^^^-^ ^2^0 

ccaagcccat cttctLatg gcaaa^gala ^JalttJ ^^^ttctgtg ttgccaagct 4320 
gctgtggtca cagcggagg? gtgg^^Sga S^SaScc ^^^'^^'^^^'^ ttgacctgag 4380 
cctcccaccc ggctgcagag gc??gS agcccaaart cc^^^'^'^' tggacatcct 4440 
tgacaacgtt tcagcgactc cgttggccac ??cS^^ a^^^''!^"^ cttgcttatt 4500 
atgcaccacc aagccaaggg aaccJKSc S^tSf ^f^.^^^^^^g tggatcagag 4560 
agtgtatgac tgcacatgac tcaaaaataa ^^f^ tactgcgact ttctgcctgg 4620 

ctggtccgtg ggacggtn^cc S^f tgctLtcS 4680 

^acttgtca tttcgggcaa cggc%c?g? T^lf^l^ll l^f^l^l -ataaacg 4740 



<210> 135 
<211> 2856 
<212> DNA 
<213> Homo sapiens 

<400> 135 



SgS^ SgS:? ^^^-^^^-^^^^^ accaaacgac gggggtcgga 60 

cgcacgcccg tcgccacccg cgtacccgg? gcagSaS tt^^'^^^^ gagcgccgct 120 
tggagcccag cagcaagaag ctgacggSc gcc?cSac? a^^^^^^^'' agcgctgcca 180 
ttggctccct gcagtttggc tacaacaSg llllottlT. f^^^gtggga ggagcagtgc 240 
aggagttcta caaccagaca tgggtcoacc tgccccccag aaggtgatcg 300 

tcaccacgct ctggtc^ctc S^gjca f ^^^-^5 cccaccacgl 360 

tctctgtggg cottttcgtt aaccacttta ^^n^ ^ tgggggcatg attggctcct 420 
tgctggcctt cgtgtcc^cc gtgScatg^ SSSa' "^^'^^'^ atgatgaacc 480 
tgctgatcct gggccgcttc atcatoggig ?qtaS™ ^^t^gg^aag tcctttgaga 540 
ccatgtatgt gggtgaagtg tcacccacag c?^t?cSaa allJ-T^'^ ggcttcgtgc 600 
agctgggcat cgtcgtcggc atcctcatcj ccca^aS? t^^ l^^^'' accctgcacc 660 
gcaacaagga cctgtggccc ctgctgctgl gSSSt ^ff^^^^gac tccatcatgg 720 
gcatcgtgct gcccttctgc cccgagag?J ScScS f ^^f^gcc ctgctgcagt 780 
agaaccgggc caagagtgtg ctaLgaJc tgcgSaaac tJ-t^^^'T '^STcaacgagg 840 
tgcaggagat gaaggaagag agtcgaoaaa Intf^^^ agctgacgtg acccatgacc 900 
agctgttocg ctcccccgcc taccgSc SjctSt ITJTT ^'^-^-^tgg 960 
cccagcagct gtctggcatc aacgctgtct fctatt^cf^ ^^'^'^^'^^'tg ctgcagctgt 1020 
cgggggtgca gcagcctgtg tatgccacca ttgg^tcSa tS'^'" ^^cgagaagg 1080 
ctgtcgtgtc gctgtttgtg gtaqaacaao rlr,^?.l^^ tatcgtcaac acggcc±tca 1140 
tcgctggcat ggcgggttV Scatactc! tScaS f^f'^^^"- ctcataggcc 1200 
taccctggat gtcctatctg agcatcgtgg ccatSSa ^Sf'^''^^ ctggagcagc 1260 
tgggtcctgg ccccatccca tggttcScg tggcJ^aS? ^J^^^^gcc ttctttgaag 1320 
cagctgccat tgccgttgca gglttctcca Sggfcctc aa^t.T"^^? ggtccacgtc 1380 
gcttcoagta tgtggagcaa ctgtgtggtc cctacatS^ ^ff gtgggcatgt 1440 

tggttctgtt cttcatcttc acctacttc! aSttn^^ <=atcatcttc actgtgctcc 1500 
atgagatcgc ttccggcttc cggcjggg^ gagccaacca tTl^^t^^"" ^ggaccttcg 1560 
agctgttcca tcccctgggg gc?gat?ccc aaata^a^af aagtgataag acacccgagg 1620 
gcctgctccc agcagcccfa fggatctctc aa?aacao! caccagcc^ 1680 

aacctgacag atgtcagccg agccggjcct SSt? ^ctlf^^"' gagacttcca 1740 
ccagaagaat attcaggact taacg?ctcc llllttltlt ^^^^^^^^ gcaatgatgt 1800 
aaatctattc agacaagcaa caggmtat al?t«^Jf^ caaaagcaag actgttgctc 1860 
atatcagcct gagtctcctg tgcccacatc ^tactgattt tgttattttt 1920 

gagggtggag actaagccc? g^SSa^ ttSttctt llf'^^'^ "^-tgcct 1980 
ctggacctat gtcctaagga cacao^a=f^ "gccttctt cacccagcta atctgtaggg 2040 
gaggtggcta tggccacccg Sctactaao "^^^^^^^ ctacaaagct tctatcccag 2100 
cattaggatt tgLccttcc catct'St'cS ScSaa'cca' cf^'^'f^^ ggtcaggctc '^"o 
cctgagacca gttgggagca ctaa^^o^= tacccaacca ctcaaattaa tctttcttta 2220 
gocgggttct .gtltlattt gclctS^gg cT.tlTtt fcTr"^'^ agtctgggct 228? 
gggagcctgc aaactcactg ctcaaoaa«a ^ff''^^*^*^ accatgagaa gagggcctgt 2340 
tgcaagatat ttatatata? trngSSt cLtatSS ^'o'^""''^' tgtgtataga 2400 

agttgr caatattaaa tacagacact aagttatagt 24 60 
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atatctggac aagccaactt gtaaatacac 
tataaatggc tggtttttag aaacatggtt 
tttggatggg agtgagacag aagtaagtgg 
gactcaggat ccagtccctt acacgtacct 
tttgatccct gttacccaga gaatatatac 
atcacatatt tgatagttgg tgttcaaaaa 
aggcttgaaa tcgcattatt ttgaatgtga 

<210> 136 
<211> 356 
<212> DNA 

<213> Homo sapiens 



cacctcactc ctgttactta cctaaacaga 2520 
ttgaaatgct tgtggattga gggtaggagg 2580 
ggttgcaacc actgcaacgg cttagacttc 2640 
ctcatcagtg tcctcttgct caaaaatctg 2700 
attctttatc ttgacattca aggcatttct 2760 
aacactagtt ttgtgccagc cgtgatgctc 2820 
agggaa 2856 



<400> 136 

ggtggagcca aatgaagaaa atgaagatga aagagacaga cacctcagtt tttctggatc 60 

aggcattgat gatgatgaag attttatctc cagcaccatt tcaaccacac cacgggcttt 120 

tgaccacaca aaacagaacc aggactggac tcagtggaac ccaagccatt caaatccgga 180 

agtgctactt cagacaacca caaggatgac tgatgtagac agaaatggca ccactgctta 240 

tgaaggaaac tggaacccag aagcacaccc tcccctcatt caccatgagc atcatgagga 300 

agaagagacc ccacattcta caagcacaat ccaggcaact cctagtagta caacgg 356 

<210> 137 

<211> 356 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 254, 264, 279, 281, 290, 328, 342 
<223> n = A,T, C or G 



<400> 137 

gcaggtggag aagacatttt attgttcctg 
gtcactggct gcccccggaa cagggcgctg 
ctatgtctcc cagcaaggac agaaactcag 
cctttttctc aaagacatcg gcgaggtaat 

cgctaaggcc aaanttccag acanayggcc 
ggacccaaac tctggcgcgg aaacacangg 

<210> 138 
<211> 353 
<212> DNA 

<213> Homo sapiens 



gggtctctgg aggcccattg gtggggctgg 60 
ctccatggct ctgcttgtgg tagtctgtgg 120 
aaaaatcaat cttcttatcc tcattcttgt 180 
ttgtgccctt tttacctcgg cccgcgacca 240 
gggccggtnc nataggggan cccaacttgg 300 
gcataagctt gnttcctgtg gggaaa 356 



<400> 138 

aggtccagtc ctccacttgg cctgatgaga 
aatagacact tagatttctc tcttgtggga 
tacattgatg tggaaattgc tgctgctacc 
aatgccagcc atcttggcat cctggccctc 
ctccagccgt gtctttatgt caagcagcat 
gcatcggagc tcactcagac ctcgsccgsg 

<210> 139 
<211> 371 
<212> DNA 

<213> Homo sapiens 



gtggggagtg gcaagggacg tttctcctgc 60 
agaaaccacc tgtccatcca ctgactcttc 120 
accacctcct gaagaggctt ccctgatgcc 180 
gagcaggctg cggtaagtag cgatctcctg 240 
cttgtactcc tggttctgag cctccatctc 300 
mssmcgctam gccgaattcc age 353 



<400> 139 
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IT^t^ t,.a,.,a„ eo 

ss.- 

actatttgac acagtggatc tctatarr!^ agtgatccca aaagatgaaa atggaaaaat 240 
aggattggac ctg^Sgg^ ^gc^Sa aaSt? H''^^-^^^ gtSagatgc aJo 
actagtggat c '''' ^ ""^^ aagccgaatt ccagcacact ggcggccgtt 360 

371 

<210> 140 



<211> 370 
<212> DNA 
<213> Homo sapiens 

<400> 140 



SSg g%'S:?^?? "tg..a ct ^^^^^^^^^^ 

aggagctgcc tgagtggLc ???ctc2cc llTtlTS T''^^^'^ ctacaagaag 120 
aatagaggta tttttaggct atttttgtJJ tSaac^trJ ^^^^'^^^^c ctcatggcag 180 
tgaattccca agccctgcat tgtacagccc cccSc!^ f^^^^^^^c cctgtgtagc 240 
agttaaoact caaaaaaaaa aaaaaac^^^ tcaccaccta ataaaggaat 300 

gcacactggc aaaaaacctg cccgggcggc cgctcgaaag cogaaticca 360 

<210> 141 



<211> 371 
<212> DNA 
<213> Homo sapiens 

<400> 141 



llltTtilTc lllTolllll cS2S ^^^^'^-^-^ ^ ^0 

aaggagcttc agggtcctgg tact^Sa SaS?^ ^-^9fggtca tcactatccc 120 
catcctcagg .gggtacccgc tcttcctcct Itlnlt ^^^^tattca gagtactcat 180 
catggagctg ggagccggca gtgtctgcaj cataa^f^^ gacgcggagc acaggcacag 240 
cgatgaactg gccctggcag gc?cag?gc? Z^cllllt ^^^^^^tcgt gatccagatj 300 
ccgctcgaag c ^'"^ gactcatctc ttggcgacct gcocgggcgg 360 

371 

<210> 142 



<211> 343 
<212> DNA 
<213> Homo sapiens 



<400> 142 

iziiiiii sss imiir' 'T.'^'''^' ^^-----t 60 

agagcagttt tgaaacac?c t{ttg?agaa tttJ-^T^^ agttgaacat tccttttcat 120 
gtcttcattg gaaacgggat accttS?at f^""^ gatgattgga tcgctatgag ISQ 

tctttgggat gtgggS^tc aacccacaga gTaTatlTc aS'T" ^^-^^a^tt" 240 
aacacccttt ttgtagaatc tacaggtgga ItTtlgllt 3"'^''"^" g^agttttga 300 
<210> 143 



<211> 354 
<212> DNA 
<213> Homo sapiens 

<400> 143 



sssss .~t. 

gt,«g,a^ ««=.?i. SSSScc t'SS" 

y a^tcgga tctatgaaaa agtagaataa 180 
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aaattccatc atcactttgg acaggagtta 
agcaaatctc catactgttt ctttcttttt 
cataaacatt ttacatgcag ctatttcaaa 

<210> 144 
<211> 353 
<212> DNA 
<213> Homo sapiens 



attaagagaa tgaccaagct cagttcaatg 240 
tttttcatta ctgtgttcaa ttatctttat 300 
gtgtgttgga ttaattagga teat 354 



<400> 144 

ggtcaaggac ctgggggacc cccaggtcca 
cctagagcac atctggatct cagccccacc 
aagatgacag actaagtagg attctgccat 
gttaagttgc ttaactttca ttctgtctta 
gaaaccatgc cccagagaag gttaagtgac 
aggtttgcct gataccagac ctgtggcccc 

<210> 145 
<211> 371 
<212> DNA 
<213> Homo sapiens 



gcagccacat gattctgcag cagacaggga 60 
cctggcaacc tgcctgccta gagaactccc 120 
ttagaataat tctggtatcc tgggcgttgc 180 
cgatagtctt cagaggtggg aacagatgaa 240 
ttcctcttta tggagccagt gttccaacct 300 
acctcccatg caggtctctg tgg 353 



<400> 145 

caggtctgtc ataaactggt ctggagtttc 
ttcctgagac ttgctggcct ctccgttgag 
attgccactg ttgatcacta gctttttctt 
aatgcaaact gcaagaatca aagccaaggc 
tggaatttgg ggtgtcctta taggaccaga 
atgtgagacc tcggccgcga ccacgctaag 
tagtggatcc g 

<210> 146 
<211> 355 
<212> DNA 
<213> Homo sapiens 



tgacgactcc ttgttcacca aatgcaccat 60 
tccacttggc tttctgtcct ccacagctcc 120 
ctgcccacac cttcttcgac tgttgactgc 180 
caagagggat gccaagatga tcagccattc 240 
ggttgtgttt gctccacctt cttgactccc 300 
ccgaattcca gcacactggc ggcccgttac 360 

371 



<400> 146 

ggtcctccgt cctcttccca gaggtgtcgg 
caggatggcg agtagcagcg gctccaaggc 
ggtacggaag atcgggtctg gctccttcgg 
cggcgaggaa gtggcagtga agctagaatc 
cgagagcaag ctctataaga ttcttcaagg 
tggtcaggaa aaagactaca atgtactagt 

<210> 147 
<211> 355 
<212> DNA 
<213> Homo sapiens 



ggcttggccc cagcctccat cttcgtctct 60 
tgaattcatt gtcggaggga aatataaact 120 
ggacatctat ttggcgatca acatcaccaa 180 
tcagaaggcc aggcatcccc agttgctgta 240 
tggggttggc atcccccaca tacggtggta 300 
catggatctt ctgggaccta gcctc 355 



<400> 147 

ggtctgttac aaaatgaaga cagacaacac 
tactatgcac gtgctgtgat tttgaacata 
tgacttttta ggttggctga tccatcaatc 
ttgttaggag caaagctgac ctgaacagca 
tttttcccat aatatgggaa atattttaag 
acatttggta tatcttcatt ctttgaaaca 



aacatttact ctgtggagat atcctactca 60 
actcgtccca aaaacttgtc acgatcatcc 120 
ttgcactcaa ctgttacttc tttcccagtg 180 
accaatggct gtagataccc aacatgcagt 240 
tctatcattc cattatgagg ataaactgct 300 
caatctatcc ttggcactcc ttcag 355 



<210> 148 
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<211> 369 
<212> DNA 
<213> Homo sapiens 

<400> 148 



aggtctctct ccccctctcc ctctcctoro = 

caccttcctt catgatgtgg gaagagtgc? allT.t''^^^ agacatgctt acttcccctt 60 
agggagtgtg ccgagggct? ?tg!gaaggt t\ctcSa f^^'^^^^^ accgcatgag 120 

L= ~| f lT.T-il 

„ga gagaagt^ac a%c?c?- "^^^^ -agaggaa g_^ f 

<210> 149 
<211> 620 



<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> ^^2, 528, 559, 599 

<223> n « A/T,C or G 

<400> 149 



catgt^tJtc t^llltT, lttlTt?.t: t^T'T.' ^"^^-^^^ ^ttatagttt 60 
gccaatattt ccttatatc? atcclt^ uSSS" ^^^'^^^^^tta cctatactat 120 
SSt'SL" ,%--"^ata aggtaaaa^t gag^'^tSa t^ft'T -^^-tgcac 180 
SaaoS tttccaaata gaatggactt gg?Sgttaa a^^^^ atotgatcaa 240 

ataaggttaa aagttgttaa tgaccaaaca ttrtaaa^^^ ^ctaagga gaagaggaag 300 
tttcaagcct tcgaactatt taaaaaaa«^ "ctaaaaga aatgcaaaaa aaaagtttat ^fin 
gagaatttct cattaatatc c?gaa?Ja?t ^ttt.'^\" '^ctaaatgca tatcEu^ 420 
atgtctctaa gaaagtacta tttcat^^! catttcacta aggctcatgt tnactccoS Ian 
tttcccttaa gtgtSaant a^^taaaSj lltt^tl^^l tgccatan?t gggta^aSc 54? 
agggttaagg gtgttgggga ^^^^^ aaattttcct ctttttaaaa atLtttIS Ito 

<210> 150 
<2H> 371 
<212> DNA 
<213> Homo sapiens 



<400> 150 



ggtccgatca aaacctgcta cct 

gagcaaccag tatcacttcc ctottt^f^ ^^tttaotag tgccgataaa ctttctcaaa 60 
atgctgaaaa ccacctgg^c tgcatJSt. ^^'=^*=^^'= catctctttg ttctttgaac 120 
aaaatttaat tttagggSt cltllTtTtl t^.T'".'^ yaattcttt? ctc"a!a?g ilo 
atatgtgtaa ggtgaaattt atggtattt" ITtltaT ^ tgtagtatta ttatttcctt 240 

T~ -"IS a%\%\-- 3300 

<210> 151 

<211> 4655 

<212> DNA 

<213> Homo sapiens 



<400> 151 



gggacttgag ttctgttatc ttcti-^^rri-^ 

^t^«t,„ g^^^j.. 
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acatgttgta cctggaaaac aatgcccaga 
tggggctcct gaacagcatg gaccagcaga 
ataacacaga ccacgcgcag aacagcgtca 
ccaccttcga tgctctctct ccatcacccg 
cgcacagttt cgacgtgtcc ttccagcagt 
attccactga actgaagaaa ctctactgcc 
aggtgatgac cccacctcct cagggagctg 
ctgagcacgt cacggaggtg gtgaagcggt 
acgagggaca gattgcccct yctagtcatt 
agtatgtaga agatcccatc acaggaagac 
aggttggcac tgaattcacg acagtcttgt 
gagggatgaa ccgccgtcca attttaatca 
tcctgggccg acgctgcttt gaggcccgga 
cggatgaaga tagcatcaga aagcagcaag 
cgaagcgccc gtttcgtcag aacacacatg 
gatccccaga tgatgaactg gtatacttac 
tggtgaagat caaagagtcc ctggaactca 
cgtacaggca acagcaacag cagcagcacc 
agtctccatc ttcatatggt aacagctccc 
agctgccttc tgtgagccag cttatcaacc 
ccattcctga tggcatggga gccaacattc 
gagacatgaa tggactcagc cccacccagg 
cctcccactg cacaccccca cctccgtatc 
cgaggttggg ctgttcatca tgtctggact 
atcagattga gcattactcc atggatgatc 
gacatgcgat ctggaagggc atcctggacc 
ctcatctcct gcggacccca agcagtgcct 
ggggtgagcg tgttattgat gctgtgcgat 
cccgagatga gtggaatgac ttcaactttg 
gcatcaaaga ggagggggag tgagcctcac 
tgccagcccc ctaaaagcac tcctgcttaa 
ttcctcttgt ctgatttctt aggggaagga 
atctgacctg gcatctaatt ctgattctgg 
actgtagctt gccatggcta ggtagaagtg 
tgcagagatt tctcattgac ttttataaag 
atataaatgt ataaatatac agtatagatt 
aatgtaattt aaatgaaaga aaattgagtt 
ttttggatgg cttgtctata ctccttccct 
tagagcttaa tgctacatgt gagtgacgat 
ctaaatacat gccacatcaa acctttgagt 
aagactgtag atatgtattc ttttctcagt 
tctagtgatg atggttcacg ttggggtgat 
caaacgtcct ctttagtttt tggttgggaa 
ccagttcaaa aacacccgac gtcatgtatt 
taccagatac cttatcttac aatattgatt 
ttaaaactaa atttcactac tagattgact 
gaattctgat tgatttgatt gggatgaatg 
actgtctatt aatattcagg gtaaatagga 
cagtaagata tctcaatgaa ccataaattc 
aatattgttt ggtaaatgtt tcttttgttt 
tctataaaac tctgcatgta gaggcttgtt 
agtggtgatt tgaaaaatat aaaattatga 
actgtatcat tttctttttt aaccggtaag 
aacccagttt cccgtccatc tcccttaggg 
agcaagagat aagtctttca tggctgctgt 
tgaaactttg ggaaaacatg ttaatgacaa 
ttttttgcat gcatgcaaat gagctctgaa 
cattgcacat aagcttccat tttaatttta 



ctcaatttag tgagccacag tacacgaacc 180 

ttcagaacgg ctcctcgtcc accagtccct 240 
cggcgccctc gccctacgca cagcccagct 300 
ccatcccctc caacaccgac tacccaggcc 360 
cgagcaccgc caagtcggcc acctggacgt 420 
aaattgcaaa gacatgcccc atccagatca 480 
ttatccgcgc catgcctgtc tacaaaaaag 540 
gccccaacca tgagctgagc cgtgaattca 600 
tgattcgagt agaggggaac agccatgccc 660 
agagtgtgct ggtaccttat gagccacccc 720 
acaatttcat gtgtaacagc agttgtgttg 780 
ttgttactct ggaaaccaga gatgggcaag 840 
tctgtgcttg cccaggaaga gacaggaagg 900 
tttcggacag tacaaagaac ggtgatggta 960 
gtatccagat gacatccatc aagaaacgaa 1020 
cagtgagggg ccgtgagact tatgaaatgc 1080 
tgcagtacct tcttcagcac acaattgaaa 1140 
agcacttact tcagaaacag acctcaatac 1200 
cacctctgaa caaaatgaac agcatgaaca 1260 
ctcagcagcg caacgccctc actcctacaa 1320 
ccatgatggg cacccacatg ccaatggctg 1380 
cactccctcc cccactctcc atgccatcca 1440 
ccacagattg cagcattgtc agtttcttag 1500 
atttcacgac ccaggggctg accaccatct 1560 
tggcaagtct gaaaatccct gagcaatttc 1620 
accggcagct ccacgaattc tcctcccctt 1680 
ctacagtcag tgtgggctcc agtgagaccc 1740 
tcaccctccg ccagaccatc tctttcccac 1800 
acatggatgc tcgccgcaat aagcaacagc 1860 
catgtgagct cttcctatcc ctctcctaac 1920 
tcttcaaagc cttctcccta gctcctcccc 1980 
gaagtaagag gcttacttct taccctaacc 2040 
ctttaagcct tcaaaactat agcttgcaga 2100 
agcaaaaaag agttgggtgt ctccttaagc 2160 
catgttcacc cttatagtct aagactatat 2220 
tttgggtggg gggcattgag tattgtttaa 2280 
gcacttattg accatttttt ' aatttacttg 2340 
taaggggtat catgtatggt gataggtatc 2400 
gatgtacaga ttctttcagt tctttggatt 2460 
agatccattt ccattgctta ttatgtaggt 2520 
gttggtatat tttatattac tgacatttct 2580 
ttaatccagt tataagaaga agttcatgtc 2640 
tgaggaaaat tcttaaaagg cccatagcag 2700 
tgagcatatc agtaaccccc ttaaatttaa 2760 
gggaaaacat ttgctgccat tacagaggta 2820 
aactcaaata cacatttgct actgttgtaa 2880 
ccatctatct agttctaaca gtgaagtttt 2940 
atcattcaga aatgttgagt ctgtactaaa 3000 
aactttgtaa aaatcttttg aagcatagat 3060 
ggtaaatgtt tcytttaaag accctcctat 3120 
tacctttctc tctctaaggt ttacaatagg 3180 
gattggtttt cctgtggcat aaattgcatc 3240 
agtttcagtt tgttggaaag taactgtgag 3300 
actacccata gacatgaaag gtccccacag 3360 
tgcttaaacc acttaaacga agagttccct 3420 
tattccagat ctttcagaaa tataacacat 3480 
atcttcccat gcattctggt caagggctgt 3540 
aagtgcaaaa gggccagcgt ggctctaaaa 3600 
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SS?ga^ ?!t1SS llTtTtt ?.HT'' *----"t 3660 

accttttttt atcgtttttg tat"tca?^ aaSacSt cacacttgaa 3720 

aataagaaat aatgctacaa ttttaagag? aaa^^^f "^^taagaa taccacatca 3780 
tttttttaaa attttgtatg ttaaagagS tllTc^^a ^^^f^"" ttttttatta 3840 
ttaaatggta ataagcactg taaacftltg caSa^!? ^^^^^^^^t tttgttgtac 3900 
aggggaagaa tgaaagctgt tccttgg^co JJS^^^ gcagctttgc aaacccatta 3960 
tgctgagggt ttgaataaac ctaggj^^ttc ctalctlTJ ^^^^^^5°^ tcccttactt 4020 
tagggccttg gaaatccctg tactgtgtct caJgSt^?. ^^f^^att caggtaacac 4080 
ccccttactg gcttacctcc tcatggcagc c?^c?Scc? f ^^^agcca aagcgaggca 4140 
gtaaggggta aaaggatagt aagca?agla accactaoaa l^^^tt^ agtagccagg 4200 
tgggcctca gctcaatgca gttagctgaa gStgafaa JlfTltt ^^^gagitct 4260 
ataaacagaa atggaaagca gagttttcat taaatStf^ Stttttgttt ggagacgttt 4320 
aatcccctaa aataacagta tgtgggaJat taa^J^^ tacctttttt ttttcttggt 4380 
tttttataat tgtacaaaat taaJSaaJg tllllf^tt ^^^^^^^"t ttttctatU 4440 
tcaaaaggta ttatacatgt gatacattt? «aaanS ^atatgcttt attaatgttt 4500 
ttctgttatg ggottttggg LgccJaaaa ^^^ff^ gttgcttgtc ttctgg?act 4560 
acatgcaata aaatttalff ItHHttH aS ^^^^^^^tt ^tttgccagg 4620 

<210> 152 ^^^^ 

<211> 586 

<212> PRT 

<213> Homo sapiens 

<400> 152 

>f L» TV. ^„ ^„ ^ 

- - - ^„ , Z ^ ^ U 

CI. s„ 3„ . ^„ 30__ 

Val Thr Ala Pro Ser Pro Ivr ai\. « 

50 '^''^ Gin Pro Ser Ser Thr Phe Asp Ala 

ser Pro Ser Pro Ala Xle Pro Ser Asn ^ Sp p., 
His ser P.e Asp Val Ser P.e Oln Oln Ser S^fr ..r Ala ..s Ser IL 
- .rp ^ 3er .r .1. .eu .,3 .eu c.s 01„ ^ Ala 

.Hr gs Pro Xle 01„ Xle ..3 Met ..r Pro Pro III .i„ 
-a Val Xle Arg Ala Met Pro vfj .,r ..3 .,3 Ala III His Val Xhr 

Val val .,3 Arg C.s Pro Asn His .1. . ^ 
- .1, oin Xle Ala Pro Ser Ser His .eu III Arg Val Olu 01. 

ser His Ala Gin Tyr Val Glu Asp Pro xle Thr n . , 

180 ^ 1 f° ^hr Gly Arg Gin Ser Val 

Leu Val Pro Tyr Glu Pro Pro ri,. ,7 , ^, 

195 '^'^^ JJ" <51y Thr Glu Phe Thr Thr Val 

Leu Tyr Asn Phe Met Cys Asn sir- c ^ ^05 

S ""^^ ''^y Asn Arg 

Arg Pro He Leu lie ii^ vt^ . ^20 ^ 

230 ^"'^ "''^ ^5 *3p Gly Gin Val 

Leu Gly Arg Arg Cys Phe Glu Ala Arg lie cll Al. r o ^40 
245 ^ ^a Cys Pro Gly Arg 

Asp Arg Lys Ala Asp Glu Aso S^r- ti= » r 255 

P ^xu Asp Ser Xle Arg Lys Gin Gin Val Ser Asp 
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260 



Ser 


Thr 


Lys 


Asn 


Gly Asp 


Gly Thr 






275 










280 


His Gly 


He 


Gin 


Met 


Thr 


Ser 


He 




290 










295 




Glu 


Leu 


Val 


Tyr 


Leu 


Pro 


Val Arg 


305 










310 






Val Lys 


He 


Lys 


Glu 


Ser 


Leu 


Glu 










325 








Thr 


lie 


Glu 


Thr 


Tyr Arg 


Gin 


Gin 








340 










Leu 


Gin 


Lys 


Gin 


Thr 


Ser 


He 


Gin 






355 










360 


Ser 


Pro 


Pro 


Leu 


Asn Lys 


Met 


Asn 




370 










375 




Ser 


Gin 


Leu 


He 


Asn 


Pro 


Gin 


Gin 


385 










390 






lie 


Pro 


Asp 


Gly 


Met 


Gly 


Ala 


Asn 










405 








Pro 


Met 


Ala 


Gly 


Asp Met 


Asn 


Gly 








420 










Pro 


Pro 


Leu 


Ser 


Met 


Pro 


Ser 


Thr 






435 










440 


Tyr 


Pro 


Thr 


Asp 


Cys 


Ser 


He 


Val 




450 










455 




Ser 


Ser 


Cys 


Leu 


Asp 


Tyr 


Phe 


Thr 


465 










470 






Gin 


He 


Glu 


His 


Tyr 


Ser 


Met 


Asp 










485 








Glu 


Gin 


Phe 


Arg 


His 


Ala 


He 


Trp 








500 










Leu 


His 


Glu 


Phe 


Ser 


Ser 


Pro 


Ser 






515 










520 


Ala 


Ser 


Thr 


Val 


Ser 


Val 


Gly Ser 




530 










535 




He Asp 


Ala 


Val 


Arg Phe 


Thr 


Leu 


545 










550 






Arg Asp 


Glu 


Trp 


Asn Asp 


Phe 


Asn 










565 








Lys 


Gin 


Gin 


Arg 


He 


Lys 


Glu 


Glu 



580 



265 




270 






Lys 


Arg 


Pro Phe Arg Gin 


Asn 


Thr 






285 






Lys 


Lys 


Arg Arg Ser Pro 


Asp 


Asp 






300 






Gly 


Arg 


Glu Thr Tyr Glu 


Met 


Leu 






315 




320 


Leu 


Met 


Gin Tyr Leu Leu 


Gin 


His 




330 




335 




Gin 


Gin 


Gin Gin His Gin 


His 


Leu 


345 




350 






Ser 


Pro 


Ser Ser Tyr Gly 


Asn 


Ser 






365 






Ser 


Met 


Asn Lys Leu Pro 


Ser 


Val 






380 






Arg 


Asn 


Ala Leu Thr Pro 


Thr 


Thr 






395 




400 


He 


Pro 


Met Met Gly Thr 


His 


Met 




410 




415 




Leu 


Ser 


Pro Thr Gin Ala 


Leu 


Pro 


425 




430 






Ser 


His 


Cys Thr Pro Pro 


Pro 


Pro 






445 






Ser 


Phe 


Leu Ala Arg Leu 


Gly 


Cys 






460 






Thr 


Gin 


Gly Leu Thr Thr 


He 


Tyr 






475 




480 


Asp 


Leu 


Ala Ser Leu Lys 


He 


Pro 




490 




495 




Lvs 


Glv 


He Leu Asp His 


Arg 


Gin 


505 




510 






His 


Leu 


Leu Arg Thr Pro 


Ser 


Ser 






525 






Ser 


Glu 


Thr Arg Gly Glu 


Arg 


Val 






540 






Arg 


Gin 


Thr He Ser Phe 


Pro 


Pro 






555 




560 


Phe 


Asp 


Met Asp Ala Arg 


Arg 


Asn 




570 




575 




Gly 


Glu 








585 











<210> 153 

<211> 2007 

<212> DNA 

<213> Homo sapiens 

<400> 153 

gaattcgtcg ctgctccagg gaaagttctg 
acatggccag caagaaagta attacagtgt 
tggccagggc aattttggag agcaaaaaat 
cttgaccaaa tgccctggag ctccagcgcc 
atgataaagc atcggtggac agtgccttaa 
acttctggga ccctctcaac caagataagg 
ccgccaagca cctgggtctg aagcacgtgg 
tgacggatgg caagctggag gtgccgcact 
tctggtccat tggcatcccc atgaccagtg 



ttactccact gactctctct tttcctgata 60 
ttggagcaac aggagctcaa ggtggctctg 120 
ttgcagtgag agcagtgacc agggatgtga 180 
ttggagctga ggtggtcaaa ggtgacctga 240 
aaggtgtcta tggggccttc ttggtgacca 300 
aagtgtgtcg ggggaagctg gtggcagact 360 
tgtacagcgg cctggagaac gtcaagcgac 420 
ttgacagcaa gggcgaggtg gaggagtact 480 
tccgcgtggc ggcctacttt gaaaactttc 540 
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tcgcggcgtg gcggcccgtg aaaccctft-^ =4-^ 

tgggagatgt accaatggat ggtatctSa tSS^f^!^ ^tacaccttg gctgtaccga 600 
tttttaattc tccagaggaa tttttagS aSo^^f^* tggagcagcc gtctctagca 660 
caatacagca atatgctgat gtttjgtcca llln^^t^^'' ^^'^^Stgca gaagcacLa 720 
agattacccc ggaagctttc gagaaaSa ttff gaaagaagtc cgagatgcaa 780 

tgtgtcgttt ctatgaaatg aagccagaoc aa^^ff agcaaaggaa atagccaata 840 
ccaaagtcaa aagcttcagc cag???Sct llaTallT^ tctcacccac caaltaaatc loo 
agaaaatcag ctgttcagat aggcctctac ^^^'^^^ gggagccttc aagggcatgt 960 
ttcctcttta cggcacaaca ttSt^S^f accacacagc ctctttcctc tctgatcctt lo?n 
caccgaagga tttcctgcgg tcTollllll S"'"" tggaatgcaa ttgStgcL 1080 
cggtaatttg attcacattt aacttgctag SStS f."^^ tgataggaca 140 
aaaatgagaa gcctcggaac ttggagcttc ^ctS^o! ^f^^^^^^""^ actgtttggt 1200 
SSJ' f -^^55tg ag?L?ttcJ agcSa Ttatl^tr^ ^^^agattat 1260 
tccagttttc tcaactgcat tgcaaaattr ^^IZT ? ctgaaattcc cctaggcagc 1320 
aaaaatgaac atctttg'tag a^a^mlct gg^JStl llTt^'T "*-'tt?: iMS 
agcattggaa atgctaaaat tcagttttgc Itlttlttl^ gtgttcaatg aacaagcaca 1440 
ttcatgaagt catctattga gccaccaUc Stt^^f ^ gaagtttatt ttctgactca 1500 
catttatcca ttctgcaaac ttttcttaaa ^^^^^^^tattaattc cttgatcctt 1560 
tcttcattcc tatgtgtttt cttatcaa;^ SatS t^^^^^^^^^ tgt^gacttc 1620 
ctgtggttgg gttcaagtca tgccagggcc a^aaarS^ tcgaaaggct cctttccagt 1680 
caaaatccag gggatctgca gtgggga?ca tctcctcgtt tagctctagg 1740 

agggtaggga tgtggaaaga Jaag^glS ?2Scca T^^^^'^"" aggcctgtS 18oS 
tggcttagca ttttctacat catattgtS SSctta^t f tctatatctc 1860 

gtgagtgact aacagtcatc tttate^r^ ^'^^^^'^^tt tgctagtttt cttcottact 1920 
gttgattgac taaaaaaaaa aaa^^S'^ tgcctggtac ataataagtg atcaataaS I'sS 

<210> 154 
<211> 2148 



<212> DNA 
<213> Homo sapiens 

<400> 154 



^ggS^g^ mZT^ It-l-r 60 

cf= ~ S.tc- a-Xlf lis 

rt^^t^^T ^*°59tggac agtgccS UlTalll^^ ^t^gtcaaa ggtgacctga 240 
cacctgggtc tgaagcacgt ggtgtacJac aao^?f tggtggcaga ctccgccaag 300 
ggcaagctgg aggtgccgca c?t?gacagc aSSal^ ^cgtcaagcg actgacgga? 360 
attggcatcc ccatgaccag tgtc^gc^g gcgScta!? '^^^^^agta cttctggtcc 420 
tggcggcccg tgaaagcctc taatal^r^t gcggcctact ttgaaaactt tctcgcggcq 480 
gtaccaatgg atggt^tctc tS^SgS attaaf'"' ^^^^^^tacc gatgggaga? tli 
tctccagagg aatttttagg caaggccgS aaS.^''^^ ccgtotctag catttttaat 600 
caatatgctg atgttttgtc caaggcSta aaaf cagaagcact aacaatacag 660 

tgtgctatag atgaccagaa aacagtggaa JaaS^^ tccgagatgc aaagactatc 720 
tggtccttga gggaacatga ccatgtS Ittl^ tggaagacgt gggcttgagt 780 
ctaattctgg aataaacacg acaalccS aaca^f^' <=atcaagaag gctggcct^ 840 
ctgcctctat octtgattac cccgga^? ?Saf ' ^aaggaggca aattctggi? 900 
gaaatagcca atatgtgtcg tttcS?S aSaf tgggattccc tgcagcaaag 960 
caccaactaa atcccaaagt caaaagcttc agcSaS^ f^gagatgt caatctcacf 1020 
ttcaagggca tgtagaaaat cagctStcT gataaaJ^^ tctcagagaa ccagggagcc 1080 
ctctctgatc cttttcctct ttac(T„n=^ ^ataggcctc tgcaccacac agcctctttc liln 
caattgtttg caacaccgaa ggaS?ccta ca^""''^ tgacagaaca t|ctggaatg 12"? 
tggtgatagg acacggtaat ItHtltfcl ttfaJt^^'' ttcagtagga agcac?gca? ilTo 
acaactgttt ggtaaaatga gafgcctcaa aJct^ tagttagtga taagggtggt 1320 
gagggcagat tatactggga fttctSw f^^^'^ ttctctccta ccactaatgg 1380 

tcccctaggc agctccag?t ^"tcaac^a S?^^*^^" tcaagcccta atgctgaaf? 144 J 
acttttaact taaaaaaatg aa2?cm1 SgaSm "^^^^^^ -^^^Lagt Ittl 

g ^agagaattt tctggggaac atggtgttca 1560 
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atgaacaagc acaagcattg gaaatgctaa aattcagttt tgcctcaaga ttggaagttt 1620 

attttctgac tcattcatga agtcatctat tgagccacca ttcaattatt catctattaa 1680 

ttccttgatc cttcatttat ccattctgca aacttttctt gagcaccagc acgggtggcc 17 40 

atttgtggac ttctcttcat tcctatgtgt tttcttatca aagtgatcca ctctcgaaag 1800 

gctcctttcc agtctgtggt tgggttcaag tcatgccagg gccagggggc ccatctcctc 1860 

gtttagctct aggcaaaatc caggggatct gcagtgggga gcgggggcag gaagctggag 1920 

ggaaggcctg tgaagggtag ggatgtggaa agacaaggtg acagaaggac ccaataggac 1980 

ctttctatat ctctggctta gcattttcta catcatattg taatcgtctt atttgctagt 2040 

tttcttcctt actgtgagtg actaacagtc atctttatcc cagtgcctgg tacataataa 2100 

gtgatcaata aatgttgatt gactaaatga aaaaaaaaaa aaaaaaaa 2148 

<210> 155 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 155 



Met Thr 


Ser Val Arg 


Val 


Ala 


Ala 


Tyr 


Phe Glu Asn 


Phe 


Leu Ala Ala 


1 


5 










10 






15 


Trp Arg 


Pro Val Lys 


Ala 


Ser 


Asp 


Gly 


Asp Tyr Tyr 


Thr 


Leu Ala Val 




20 








25 








30 


Pro Met 


Gly Asp Val 


Pro 


Met 


Asp 


Gly 


He Ser 


Val 


Ala 


Asp He Gly 




35 






40 








45 


Ala Ala 


Val Ser Ser 


He 


Phe 


Asn 


Ser 


Pro Glu 


Glu 


Phe 


Leu Gly Lys 


50 






55 








60 




Ala Val 


Gly Leu Ser 


Ala 


Glu 


Ala 


Leu 


Thr He 


Gin 


Gin 


Tvr Ala Asn 


65 




70 








75 






80 


Val Leu 


Ser Lys Ala 


Leu 


Gly 


Lys 


Glu 


Val Arg Asp 


Ala 


Lys He Thr 




85 










90 








Pro Glu 


Ala Phe Glu 


Lys 


Leu 


Gly 


Phe 


Pro Ala 


Ala 


Lys 


Glu He Ala 




100 








105 






110 


Asn Met 


Cys Arg Phe 


Tyr 


Glu 


Met 


Lys 


Pro Asp Arg 


Asp 


Val Asn Leu 




115 






120 








125 




Thr His 


Gin Leu Asn 


Pro 


Lys 


Val 


Lys 


Ser Phe 


Ser 


Gin 


Phe He Ser 


130 






135 








140 






Glu Asn 


Gin Gly Ala 


Phe 


Lys 


Gly 


Met 










145 




150 
















<210> 156 


















<211> 128 


















<212> PRT 


















<213> Homo sapiens 


















<400> 156 


















Met Thr 


Ser Val Arg 


Val 


Ala 


Ala 


Tyr 


Phe Glu 


Asn 


Phe 


Leu Ala Ala 


1 


5 










10 






15 


Trp Arg 


Pro Val Lys 


Ala 


Ser 


Asp 


Gly 


Asp Tyr 


Tyr 


Thr 


Leu Ala Val 




20 








25 








30 


Pro Met 


Gly Asp Val 


Pro 


Met 


Asp 


Gly 


He Ser 


Val 


Ala 


Asp He Gly 




35 






40 








45 


Ala Ala 


Val Ser Ser 


He 


Phe 


Asn 


Ser 


Pro Glu 


Glu 


Phe 


Leu Gly Lys 


50 






55 








60 




Ala Val 


Gly Leu Ser 


Ala 


Glu 


Ala 


Leu 


Thr He 


Gin 


Gin 


Tyr Ala Asp 


65 




70 








75 






80 


Val Leu 


Ser Lys Ala 


Leu 


Gly 


Lys 


Glu 


Val Arg Asp 


Ala 


Lys Thr He 




85 










90 






95 


Cys Ala 


lie Asp Asp 


Gin 


Lys 


Thr 


Val 


Glu Glu Gly 


Phe 


Met Glu Asp 
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105 



Val GX, .eu Ser Ser .eu J., ..^ h.s .sp Ms Va. 

125 



<210> 157 
<211> 424 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc^feature 
<222> 320, 322 
<223> n A,T,C or G 

<400> 157 



iniiiiiTt ixTtXi &" LT^^*-^ - 

aattcagtca ccactgttat aSJccttct ™ tttctctcca acctcottca 120 

tattagattt ccttgLtgc ^a^^t^tg SgScta I'T^^^^^ aaggctgcga 180 
agaggaagga gaaaactgca tcaiiactt? acaJaStS ^f"^*^^?^^^ aggtgagagg 240 
agcccagaaa cttctctgcn gnatctggct ?S?caS?a S^^'^^ ' ' ttccccgaaa 300 
ccccagccat cgagtcagtt tgtgcc??tg aat\^Me^ g^SgS^ ?SSt'S^ JIS 



<210> 158 
<211> 2099 
<212> DNA 

<213> Homo sapiens 
<400> 158 



cc~g g^g™ lllllZlt ace cgacgagagc 60 

ccgcgcagag cccgcgccag ggccgccggc ogTaaaalta tt^^^^"^^^ gggcgccgcc 120 
aaggcacttc ctgtcggtga Igaagacctg JctccS JJ^^^^^^^g caggcaccag 180 
caaacggggc tgacctccct tcct^gggag lalaalalS ^^^^^^rcatc ctgtgttttg 240 
agaagatctg gctaaacaat ttctg?Itgg cgSaaaaS ^ff^gaagga aaagaagtac 300 
ttcatgcatc tttaattcaa tttgaata?? ™acaao« f"°*«^"=tt gtacgccctc 360 
attgacattc gtatcatcac tgtgcaccat JggStSaa ccgagcaaag 420 

aaggaggtct gaaaccctcg cIglgggaJc Sccctca? tof^.'^''^^* Sgggtaggag 480 
gcagtcgttg gaaacaggac tcagggltaa la^lTfr. l°^^^^'39tc tgaaacactg 540 
actttcatcg ggggtgtcaa caaJcactcc acSgcIS af'^.^T ^^^^^^gcac 600 
atotttattt tccgagtcat gatcctcgtg rtqactaccr ^^^^^^^^'3 gatcacagtc 660 
caagaggact togtctgcaa cacactgcaa fcaaallart ^^^^f^^^^^ gggtgacgag 720 
tttttcccgg tgtcccacat ccggctgtj; gSSc™ g^gtg ctatgaccac 780 

gcgctgctgg tggccatgca tgtggcctS ll^al^^ tgatcttcgt ctccacccca 840 
cgaggagaga agaggaatga t^toSgac SSaaca caagttcagg 900 

atagaggggt cgctgtggtg gacgtaSSc agcaSaS "f^^^^^^^ gaaggttcgg 960 
gcagccttta tgtatgtgtt ttactt^tt tlctltaa^t ttttccgaat catctttgaa 1020 
aaatgtggga ttgacccctg ccccaacc" rt?gac?a2 t^tTt'''' <=tgggtgttg 1080 
aagaccgtgt ttaccatttt tatoattfrc^ ^l^l^ I ^^^tttctag gccaacagag 1140 
gcagagttgt gctacctgct gSgaaaJS tatltlf^^ tttgcatgct gcttaac^tg 12oJ 
caaaaaaatc accccaaLa ?gc?StJaaa a!^"^^^^ gatcaaagag agcacaglcg 1260 
atttcagata gtggtcaaaa tgScaca ^^?f^^^^5<= agaatgaaat gaatgagctg 1320 
tgtagctgcg tcataaggag actt'tgJct tTaclXl f^^*^^"^ caaggtfaaf lH'o 
ttllfj^^^ cctgaagagt ttgtaaatga cStca^aaf l^'^t ""^^ cctgaaagtt 1440 
tttttgtagg atacttgctc ca?tcataca caacrtaat^ f f ttgagttaac 1500 
aaaacaagag actgcttgac aaaggagcat tgSScact tccatctctg 1560 

yy gcat rgcagtcact ttgacaggtt ccttttaagt 1620 
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ggactctctg acaaagtggg tactttctga aaatttatat aactgttgtt gataaggaac 1680 

atttatccag gaattgatac gtttattagg aaaagatatt tttataggct tggatgtttt 1740 

tagttctgac tttgaattta tataaagtat ttttataatg actggtcttc cttacctgga 1800 

aaaacatgcg atgttagttt tagaattaca ccacaagtat ctaaatttgg aacttacaaa 1860 

gggtctatct tgtaaatatt gttttgcatt gtctgttggc aaatttgtga actgtcatga 1920 

tacgcttaag gtggaaagtg ttcattgcac aatatatttt tactgctttc tgaatgtaga 1980 

cggaacagtg tggaagcaga aggctttttt aactcatccg tttgccaatc attgcaaaca 2040 

actgaaatgt ggatgtgatt gcctcaataa agctcgtccc cattgcttaa aaaaaaaaa 2099 

<210> 159 

<211> 291 

<212> PRT 

<213> Homo sapiens 



<400> 159 



Met Asp 


Trp 


Gly 


Thr 


Leu 


His 


Thr 


Phe 


He 


Gly 


Gly 


Val 


Asn Lys 


His 


1 






5 










10 








15 




Ser Thr 


Ser 


He 


Gly 


Lvs 


Val 


Tro 


He 


Thr 


Val 


He 


Phe 


He Phe 


Arg 






20 ' 










25 










30 




Val Met 


He 


Leu 


Val 


Val 


Ala 


Ala 


Gin 


Glu 


Val 


Tro 


Glv 


AsD Glu 


Gin 




35 










40 








45 




Glu Asp 


Phe 


Val 


Cys 


Asn 


Thr 


Leu 


Gin 


Pro 


Glv 


Cvs 


Lys 


Asn Val 


Cys 


50 










55 










60 








Tyr Asp 


His 


Phe 


Phe 


Pro 


Val 


Ser 


His 


He 


Arg 


Leu 


Trp 


Ala Leu 


Gin 


65 








70 










75 








80 


Leu lie 


Phe 


Val 


Ser 


Thr 


Pro 


Ala 


Leu 


Leu 


Val 


Ala 


Met 


His Val 


Ala 








85 










90 








95 




Tyr Tyr 


Arg 


His 


Glu 


Thr 


Thr 


Arg 


Lys 


Phe 


Arg 


Arg 


Gly 


Glu Lys 


Arg 






100 










105 










110 




Asn Asp 


Phe 


Lys 


Asp 


He 


Glu 


Asp 


He 


Lys 


Lys 


Gin 


Lys 


Val Arg 


He 




115 










120 










125 






Glu Gly 


Ser 


Leu 


Trp 


Trp 


Thr 


Tyr 


Thr 


Ser 


Ser 


He 


Phe 


Phe Arg 


He 


130 










135 










140 








He Phe 


Glu 


Ala 


Ala 


Phe 


Met 


Tyr 


Val 


Phe 


Tyr 


Phe 


Leu 


Tyr Asn 


Gly 


145 








150 










155 








160 


Tyr His 


Leu 


Pro 


Trp 


Val 


Leu 


Lys 


Cys 


Gly 


He 


Asp 


Pro 


Cys Pro 


Asn 








165 










170 








175 




Leu Val 


Asp 


Cys 


Phe 


He 


Ser 


Arg 


Pro 


Thr 


Glu 


Lys 


Thr 


Val Phe 


Thr 






180 










185 










190 




He Phe 


Met 


He 


Ser 


Ala 


Ser 


Val 


He 


Cys 


Met 


Leu 


Leu 


Asn Val 


Ala 




195 










200 










205 






Glu Leu 


Cys. 


Tyr 


Leu 


Leu 


Leu 


Lys 


Val 


Cys 


Phe 


Arg 


Arg 


Ser Lys 


Arg 


210 










215 










220 








Ala Gin 


Thr 


Gin 


Lys 


Asn 


His 


Pro 


Asn 


His 


Ala 


Leu 


Lys 


Glu Ser 


Lys 


225 








230 










235 






240 


Gin Asn 


Glu 


Met 


Asn 


Glu 


Leu 


He 


Ser 


Asp 


Ser 


Gly 


Gin 


Asn Ala 


He 








245 










250 








255 




Thr Gly 


Ser 


Gin 


Ala 


Lys 


His 


Phe 


Lys 


Val 


Lys 


Cys 


Ser 


Cys Val 


He 






260 










265 










270 




Arg Arg 


Leu 


Leu 


Ser 


Ser 


Pro 


Glu 


Gly 


Asn 


Thr 


Asn 


Leu 


Lys Val 


Pro 



275 280 285 



Ser Val Ala 
290 



<210> 160 
<211> 3951 
<212> DNA 
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<213> Homo sapiens 
<400> 160 



PCT/US01/2106S 



tctgcatcca tattgaaaac ctgacacaat 

gaggcttctc tacaacatga cccaalSaJ JaSaca^at f f ^f^'^t gagtgaactg 60 
tgtgactctc ctggttgcct taagttSg! artcccaSc f^^^^^^gca acctgaagtt 120 
tcaagacaat gggtataatg gattgctcat ta^rr^r f ^^^ggagctg gagtacagct 180 
gaacctcatc tcaaacatta fggStgat ^aotaaaac^ ^'.^f^^'^^ ctgagaa?ca 240 
taccaagaga agagtatttt tcagaaaJat aaSSt??! ^^^^tttacc tatttaatgc 300 
taataataao agcaaaataa aacaagaatc aSa^^^ ^tacctgcca catggaaago 360 
ctggtatggg gcacatggag atgaSatJ SacSaca! P^^*^t<=^ tagtgactga 420 
gggaaaatac attcatttca cacctaatt? cStaSa!^^ ^^^^^^^f?^ gtggaaaaga 480 
cggatcacga ggccgagtgt ttgtccatga aWc^cac T^^^l""^^ cagctggcta 540 
tgagtataac aatgacaaac ctttctacS laltlllrt^ ctccgttggg gtgtgttcga 600 
gtgttcatct gacatoaoag gcattttS rt^J^!^ aatcaaatta aagtgacaag 660 
ctgtattatt agtaagctt? t^^igaa^ TtfclcTtt f"^- cccaagaaaa 720 
tgcaactgca tcaataatgt tcatgoaaaa ttf^f^! ^ atctacaata gcacccaaaa 780 
•aagtacccac aaccaagaag caccSacrt ^^''^ Stggttgaat tttgtaatgc 840 

atgggatgta atcacagac? c?gctgac?J tc^f ^'^''^^ atgtgcagcc tca^aagtgc 900 
gcttccacct cctcccacat tc^bgS^L agaa^!^!^" "^cccatga acgggactga 960 
gctggatgtg tccagcaaga tggcagagS SlllTjf ^ff^^^figg tctgtttagt 1020 
agaattttat ttgatgcaga t?gttga2? ^JataSS^ ^ttcaactac aacaagccgc 1080 
cagcaaagga gagatcagag cccag^t^a ccaaatS!^ gtgggcattg ccagtttcga 1140 
gctggtttca tatctgccca ccaclgtato aaot^! agcaatgatg atcgaaagtt 1200 
gcttaagaaa ggatttgagg tgattSa!! ^^^^^ gacatcagca tttgttcagg 1260 
gatattagtg accagcggag a^gaSS toUa^l^^t ^^^'^^^^^ gctltgtga? 1320 
cagtggttca acaattcact ccattgc?ct gggSaSt ^r"^"='=^ ctgtgctcag 138^ 
attatcacgt cttacaggag gtttaaaotf ^^f^^f gcagccccaa atctggagga 1440 
catgattgat gctttcagta ga^StSc iTaJllt"" ^^*^^^^<=- actccaa^ag 15oS 
tcagcttgaa agtacaggtg aaaatrtSa acnt^^^^'f gacattttcc agcaacatat 1560 
tgtggataat actgtgggca acgJcS gSSjatt Tr'l'^^'^' aoacagtgac 1620 
tcctgagatt atattatttg atcctgatgg SaaJaaS^ f'=5^9gcagg ccagtggtcc 1680 
caatctaact tttcggacag ctagtcttto tacacaaata attttatcac 1740 

gacttacacc ctgaacaata cccSc"S IclToTaat T^''^^^^^ ctgggcactg 1800 
tcgcgcctcc aactcagctg tgccccca^c Itlt ctgaaagtga cagtgacctc 1860 

cctccatttt cctcatcctg tgatqatt?a tlT^ IT ^^^tttgtgg aaagagacag 1920 
tcttaatgcc actgtcactg ccacaStaa °^^*5tg aaacagggat tttatccca? 1980 
actccttgat gatggagcag gtgcSatIt ?;t"a:a:2 I't'^^T' ttacgctgag 2040 
ttttttctcc tttgctgcaa atlgtaga?a S5c??aaa! actcgaggta 2100 

cagcataagc accccagccc actctattoo "^^^^^^^^ gtgcatgtca atcactctcc 2160 
cacagcaaac ggtaatJttc IgatgSSc tSa^a f^^^^^^^ taccaggtta 2220 
ggagcgaaag tggggcttta gccaaat~^ ^^^^^^f^aaa tcagtaggoa gaaatgagga 2280 
tccagctggc ccccLcctg atrtaS?^. ctcaggaggc tocttttcag tgctgggart 2340 
aaaagtagaa gaggaattgf Stlt^tt^J SalScct ^"^"^^^ 4aaStS 
ccaggctaca agotatgaaa taagaatgag ?aaaIatcS ^^^^^f^^^^t ttgatcaggg 2460 
aat^ff^^'^*^ attttagtaa atacatcSa gSgJaa^cct !f ^^^^^'^^ aagatgactt 2520 
gatatttacg ttctcaccco aaatttcc^c JaataaaS "^^'^^^S^tg gcatcaggga 2580 
ItTtttlT ^^^<=^-a5- tttatgtSc aataSagca f^'^f'''^'' ^aaatggaga 2640 
gtctgctgta tctaacattg cccaggcgcc ?ctat?S ^^^^tatagga actccttaca 2700 
acctgccaga gattatctta tattqal^aa ^^ff^^ ^^'^'^^caatt ctgatcctgt 2760 
catttgcctt attatagttg tgacIc^Sf L'cttSc tgataggalt 2820 

gaaagagaat ggaacaaaat tIttataaS Saata^ccf^ ^ggaaaaaga gagcagacaa 2880 
taagacccat ggccttcgac tacaaaaaca tacJaacaaa ^="<=° ttcttagata 2940 
gtattaaaat gcattgagtt tttol-an^=^ tactaacaaa gtcaaattaa catcaaaact 3000 
tggggj:!;: ttaSaaa" ctSSt "^^^^^^ gtagatcaSc ITo 

Sa\\1;- S~ • 
tctgca.at aac ^ gt^a-^:? ..agg^^^^^^^ £ 
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ctccttatct gtgcagaaca ggttgcttgt 
tatgaagccc ctaatgcaaa gctctttacc 
gaaatctcac tgctaatgct cagagatctt 
atgggtatta cctttgtctc ttcataccgg 
gtttgtaagt ttctactccc atcaaagcag 
atgatagtta tagcccttat aatgccttaa 
gttttaatac atatatgaac atatagtttt 
ggagatacta acctttggaa atgattagct 
taatcctttc tccatcaaga gttacttacc 
caaggaaata aaaatcatct ttcatcttta 
gattatcgaa caataaaatc atttgccttt 

<210> 161 
<211> 943 
<212> PRT 
<213> Homo sapiens 



<400> 161 



Met 


Thr 


Gin Arg 


Ser 


He 


Ala 


Glv 


Pro 


He 


Cys 


Asn 


Leu 


Lys 


Phe Val 


1 






5 










10 








15 


Thr 


Leu 


Leu Val 


Ala 


Leu 


Ser 


Ser 


Glu 


Leu 


Pro 


Phe 




Glv 


AIa Glv 






20 










25 












Val 


Gin 


Leu Gin 


ASD 


Asn 


Glv 


Tvr 


Asn Gly 


Leu 


Leu 


He 


Ala 


noil 






35 








40 










45 






Pro 


Gin 


Val Pro 


Glu 


Asn 


Gin 


Asn 


Leu 


He 


Ser 


noil 






wx u L. 




50 








55 










60 






lie 


Thr 


Glu Ala 


Ser 


Phe 


Tvr 


Leu 


Phe 


*Asn 


Ala 


TVi IT 

XllJL 


djys> 


Arrr 


21r>rr Val 
rix^ VdX 


65 








70 










75 






ou 


Phe 


Phe 


Arg Asn 


lie 


Lys 


He 


Leu 


He 


Pro 


Ala 


Thr 


Tro 


Lys 


Ala Asn 








85 










90 










95 


Asn 


Asn 


Ser Lys 


He 


Lys 


Gin 


Glu 


Ser 


Tyr 


Glu 


Lys 


Ala 


Asn 


Val He 






100 










105 










110 




Val 


Thr 


Asp Trp 


Tyr 


Gly 


Ala 


His 


Gly 


Asp 


Asp 


Pro 


Tyr 


Thr 


Leu Gin 






115 








120 










125 






Tyr 


Arg 


Gly Cys 


Gly 


Lys 


Glu 


Gly 


Lys 


Tyr 


He 


His 


Phe 


Thr 


Pro Asn 




130 








135 










140 








Phe 


Leu 


Leu Asn 


Asp 


Asn 


Leu 


Thr- 


Ala 


Gly 


Tyr 


Gly 


Ser 


Arg 


Gly Arg 


145 








150 










155 








160 


Val 


Phe 


Val His 


Glu 


Trp 


Ala 


His 


Leu 


Arg 


Trp 


Gly 


Val 


Phe 


Asp Glu 








165 










170 










175 


Tyr 


Asn 


Asn Asp 


Lys 


Pro 


Phe 


Tyr 


He 


Asn 


Gly 


Gin 


Asn 


Gin 


He Lys 






180 










185 










190 


Val 


Thr 


Arg Cys 


Ser 


Ser 


Asp 


He 


Thr 


Gly 


He 


Phe 


Yal 


Cys 


Glu Lys 






195 








200 










205 




Gly 


Pro 


Cys Pro 


Gin 


Glu 


Asn 


Cys 


He 


He 


Ser 


Lys 


Leu 


Phe 


Lys Glu 




210 








215 










220 






Gly 


Cys 


Thr Phe 


He 


Tyr 


Asn 


Ser 


Thr 


Gin 


Asn 


Ala 


Thr 


Ala 


Ser He 


225 








230 










235 








240 


Met 


Phe 


Met Gin 


Ser 


Leu 


Ser 


Ser 


Val 


Val 


Glu 


Phe 


Cys 


Asn 


Ala Ser 








245 










250 








255 


Thr 


His 


Asn Gin 


Glu 


Ala 


Pro 


Asn 


Leu 


Gin 


Asn 


Gin 


Met 


Cys 


Ser Leu 






260 










265 










270 




Arg 


Ser 


Ala Trp 


Asp 


Val 


He 


Thr 


Asp 


Ser 


Ala 


Asp 


Phe 


His 


His Ser 






275 








280 










285 






Phe 


Pro 


Met Asn 


Gly 


Thr 


Glu 


Leu 


Pro 


Pro 


Pro 


Pro 


Thr 


Phe 


Ser Leu 




290 








295 










300 








Val 


Glu 


Ala Gly 


Asp 


Lys 


Val 


Val 


Cys 


Leu 


Val 


Leu 


Asp 


Val 


Ser Ser 


305 








310 










315 






320 



ttacaactga agatcatgct atatttcata 3360 
tcttgctatt ttgttatata tattacagat 3420 
ttttcactgt aagaggtaac ctttaacaat 3480 
ttttatgaca aaggtctatt gaatttattt 3540 
ctttttaagt tattgccttg gttattatgg 3600 
ctaaggaaga aaagatgtta ttctgagttt 3660 
attcaattaa accaaagaag aggtcagcag 3720 
ggctctgttt tttggttaaa taagagtctt 3780 
aagggcaggg gaagggggat atagaggtcc 3840 
attttactcc ttcctcttat ttttttaaaa 3900 
ttaattaaaa acataaaaaa a 3951 
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Lys Met Ala 

Phe Tyr Leu 

Ser Phe Asp 
355 

Ser Asn Asp 

370 
Ser Ala Lys 
385 

Glu Val Val 



Glu Ala 

325 
Met Gin 
340 

Ser Lys 
Asp Arg 
Thr Asp 



Leu Val Thr 



Val Leu Ser 
435 

Ala Ala Pro 

450 
Phe Phe Val 
465 

Ser Arg He 

Leu Glu Ser 

Thr Val Thr 
515 

Thr Trp Gin 

530 
Gly Arg Lys 
545 

Thr Ala Ser 



Glu Lys 
405 
Ser Gly 
420 

Ser Gly 

Asn Leu 

Pro Asp 

Ser Ser 
485 
Thr Gly 
500 

Val Asp 



Asp Arg 

He Val 

Gly Glu 

Lys Leu 
375 
lie Ser 
390 

Leu Asn 



Leu Leu Gin 
330 

Glu He His 

345 
He Arg Ala 
360 

Leu Val Ser 
He Cys Ser 



Leu Gin 
Thr Phe 
Gin Leu 



Asp Asp 

Ser Thr 

Glu Glu 
455 
He Ser 
470 

Gly Thr 



Glu Asn 
Asn Thr 



Ala Ser 
Tyr Tyr 



Leu Trp 
565 

Tyr Thr Leu Asn Asn 
580 

Arg Ala 



Val Thr Ser 
595 
Ala Phe Val 

610 
Tyr Ala Asn 
625 

Thr Ala Thr 

Leu Asp Asp 

Ser Arg Tyr 
675 

Val His Val 

690 
Pro Gly Ser 
705 

He Gin Met 



Gly Pro 
535 
Thr Asn 
550 

He Pro 
Thr His 
Ser Asn 



Gly Lys Ala 
410 

Lys Leu Leu 

425 
He His Ser 
440 

Leu Ser Arg 

Asn Ser Asn 

Gly Asp He 
490 

Val Lys Pro 

505 
Val Gly Asn 
520 

Pro Glu He 



Tyr Leu 
380 
Gly Leu 
395 

Tyr Gly 

Gly Asn 

He Ala 

Leu Thr 
460 
Ser Met 
475 

Phe Gin 



Gin Ala Ala Glu 

335 

Val Gly He Ala 
350 

His Gin He Asn 
365 

Pro Thr Thr Val 



His His 
Asp Thr 



Glu Arg 

Val Lys 

Val Glu 
645 
Gly Ala 
660 

Phe Phe 



Asn Phe He 

Gly Thr Ala 

570 

His Ser Leu 

585 
Ser Ala Val 
600 

Leu His Phe 



Arg Lys Trp 



Asn His 

His Ala 

Asn Ala 
725 
Gly Phe 
740 

Pro Ala 



Asp Ser 
615 

Gin Gly Phe Tyr Pro 
630 

Pro Glu Thr Gly Asp 
650 

Gly 7U.a Asp Val He 

665 

Ser Phe Ala Ala Asn 
680 

Ser He Ser 



He Leu 
540 
Thr Asn 
555 

Lys Pro 

Gin Ala 

Pro Pro 

Pro His 
620 
He Leu 
635 

Pro Val 



Lys Lys Gly Phe 
400 

Ser Val Met He 
415 

Cys Leu Pro Thr 
430 

Leu Gly Ser Ser 
445 

Gly Gly Leu Lys 

He Asp Ala Phe 
480 

Gin His He Gin 
495 

Gin Leu Lys Asn 
510 

Met Phe Leu Val 
525 

Phe Asp Pro Asp 

Leu Thr Phe Arg 
560 

Gly His Trp Thr 
575 

Leu Lys Val Thr 
590 

Ala Thr Val Glu 
605 

Pro Val Met He 



Leu Gly Val 

755 

He He Asp Leu Glu 
770 



Ser Pro 
695 
Met Tyr 
710 

Pro Arg 

Ser Arg 

Gly Pro 

Ala Val 
775 



Thr Pro 
700 

Val Pro Gly Tyr Thr 

715 

Gly Arg 



Asn Ala Thr Val 
640 

Thr Leu Arg Leu 
655 

Lys Asn Asp Gly He Tyr 
670 

Gly Arg Tyr Ser Leu Lys 
685 

Ala His Ser He 



Lys Ser Val 
730 
Val Ser Ser 

745 
His Pro Asp 
760 

Lys Val Glu 



Gly Gly 

Val Phe 

Glu Glu 
780 



Ala Asn Gly Asn 
720 

Asn Glu Glu Glu 
735 

Ser Phe Ser Val 
750 

Pro Pro Cys Lys 
765 

Leu Thr Leu Ser 
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Trp Thr 


Ala Pro 


Gly Glu Asp 


Phe 


Asp 


Gin Gly 


Gin Ala Thr Ser Tyr 


785 




790 






795 


800 


Glu He 


Arg Met 


Ser Lys Ser 


Leu 


Gin 


Asn He 


Gin Asp Asp Phe Asn 






805 






810 


815 


Asn Ala 


He Leu 


Val Asn Thr 


Ser 


Lys 


Arg Asn 


Pro Gin Gin Ala Gly 




820 






825 




830 


He Arg 


Glu He 


Phe Thr Phe 


Ser 


Pro 


Gin He 


Ser Thr Asn Gly Pro 




835 




840 






845 


Glu His 


Gin Pro 


Asn Gly Glu 


Thr 


His 


Glu Ser 


His Arg He Tyr Val 


850 




855 








860 


Ala He 


Arg Ala 


Met Asp Arg 


Asn 


Ser 


Leu Gin 


Ser Ala Val Ser Asn 


865 




870 






875 


880 


He Ala 


Gin Ala 


Pro Leu Phe 


He 


Pro 


Pro Asn 


Ser Asp Pro Val Pro 






885 






890 


895 


Ala Arg 


Asp Tyr 


Leu He Leu 


Lys 


Gly 


Val Leu 


Thr Ala Met Gly Leu 




900 






905 




910 


He Gly 


He He 


Cys Leu He 


He 


Val 


Val Thr 


His His Thr Leu Ser 




915 




920 






925 


Arg Lys 


Lys Arg 


Ala Asp Lys 


Lys 


Glu 


Asn Gly 


Thr Lys Leu Leu 


930 




935 








940 



<210> 162 
<211> 498 
<212> DNA 

<213> Homo sapiens 
<400> 162 

tggagaacca cgtggacagc accatgaaca tgttgggcgg gggaggcagt gctggccgga 60 
agcccctcaa gtcgggtatg aaggagctgg ccgtgttccg ggagaaggtc actgagcagc 120 
accggcagat gggcaagggt ggcaagcatc accttggcct ggaggagccc aagaagctgc 180 
gaccaccccc tgccaggact ccctgccaac aggaactgga ccaggtcctg gagcggatct 240 
ccaccatgcg ccttccggat gagcggggcc ctctggagca cctctactcc ctgcacatcc 300 
ccaactgtga caagcatggc ctgtacaacc tcaaacagtg gcaagatgtc tctgaacggg 360 
cagcgtgggg agtgctggtg tgtgaacccc aacaccggga agctgatcca gggagccccc 420 
accatccggg gggaccccga gtgtcatctc ttctacaatg agcagcagga ggctcgcggg 480 
gtgcacaccc cagcggat 498 

<210> 163 

<211> 1128 

<212> DNA 

<213> Homo sapiens 

<400> 163 

gccacctggc cctcctgatc gacgacacac gcacttgaaa cttgttctca gggtgtgtgg 60 

aatcaacttt ccggaagcaa ccagcccacc agaggaggtc ccgagcgcga gcggagacga 120 

tgcagcggag actggttcag cagtggagcg tcgcggtgtt cctgctgagc tacgcggtgc 180 

cctcctgcgg gcgctcggtg gagggtctca gccgccgcct caaaagagct gtgtctgaac 240 

atcagctcct ccatgacaag gggaagtcca tccaagattt acggcgacga ttcttccttc 300 

accatctgat cgcagaaatc cacacagctg aaatcagagc tacctcggag gtgtccccta 360 

actccaagcc ctctcccaac acaaagaacc accccgtccg atttgggtct gatgatgagg 420 

gcagatacct aactcaggaa actaacaagg tggagacgta caaagagcag ccgctcaaga 480 

cacctgggaa gaaaaagaaa ggcaagcccg ggaaacgcaa ggagcaggaa aagaaaaaac 540 

ggcgaactcg ctctgcctgg ttagactctg gagtgactgg gagtgggcta gaaggggacc 600 

acctgtctga cacctccaca acgtcgctgg agctcgattc acggaggcat tgaaattttc 660 

agcagagacc ttccaaggac atattgcagg attctgtaat agtgaacata tggaaagtat 720 

tagaaatatt tattgtctgt aaatactgta aatgcattgg aataaaactg tctcccccat 780 

tgctctatga aactgcacat tggtcattgt gaatattttt ttttttgcca aggctaatcc 840 
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tgtatcttgg t'gSgcSa' tttc^ala"? S^gSaJa JL'J^^" attttgtaaa 900 
tatcaagtat gttgataaat gacacaatga TgtllcTcta ItllT^'lt ^^^^^^taca 960 
aatgcctaaa tataattatc caaattgatt t^oo^^^^^ t^ttgtggtt gattttaatg 1020 
ttaaatttgt aaagaatgtc taataaaaU tlltHlTtl TclStT' '''' 
<210> 164 ^^^^ 
<211> 1310 
<212> DMA 
<213> Homo sapiens 

<400> 164 

??ScX' t^oL^Sc' iZZltl lir'^^'' ^^^^^^.cgg 60 

gagacgtgta aacacfctac ttT^ttgl tTcttlUlt lllT'ttr. ^^^^^-^t?? 1^20 
ttatttcaga ggaagcgcct ctgatttg?? tct?uJ?cr o^f^^!"!* attttcgcta 180 
tgtggtttgg agaaagcaca gttggagSg ccaattact^ ctttttgctc tttctggctg 240 
ggagacgatg cagcggagac IggltcTgll gtgglacall ^^'^^^^'^'^^ ^agcgcgagc 300 
cgcggtgccc tcctgcgggc gc^cggtSa ggg?c?S^^ gcggtgttcc tgctgagcta 360 
gtctgaacat cagctcctcc ataBr-t^^r.Z ^^^^2 cgccgcctca aaagagctgt 420 

cttccttcac caLtgatcS SaaatS lllllTtalt ^f'^^^- ggcgacgaS 48o 
gtcccctaac tccaagccc? ctcccaaoac aaaaaacca^ f "f^cta cctcggaggt 540 
tgatgagggc agatacctaa ctcaggaaac Sa!aaaa^ '^'^^^^ccgat ttgggtctga 600 
gctcaagaca cctgggaaga aaaaglaagg ^aagcccSa L'f '"''""^ aagagcagcc 660 
gaaaaaacgg cgaactcgct ctgcctaatt tnl^^^^^^ aaacgcaagg agcaggaaaa 720 
aggggaccac ctgtctgaca ccSaac Staaa: 3tgactggga gtgggltaga 780 
aaattttcag cagagacott ccaaggacS Tttaclallf ctcgattcac ggaggcattg 840 
gaaagtatta gaaatattta ttgtctSJa lltrtT ^^^^^taatag tgaacatatg 900 
tcccccattg ctctatgaaa ctJcacSta aJf^f tgcattggaa taaaaotgtc 960 
S^T'^T "-"-"^t ciLtttac,' a^'aX'a'?: ttT^'tl' ""^-'ag 1020 
tttgtaaatg tatcttggtg ctgctgaatt tctata;t^^ ^^^^^'^^"g atgtatttat 1080 
gatatacata tcaagtatgt tgataaa^«» ! " ttgtaacata atgcacttta 1140 

ttttaatgaa tgccLaaS taa^taJcIa ITtHfttf ^^j?.^*^^^" ttgtggttga 12oS 
acagtatttt aaatttgtaa agaatglc^^ ^tla^l^S TcllfttTc ''''^'''''^ ° 



<210> 165 
<211> 177 
<212> PRT 

<213> Homo sapiens 



<400> 165 

*t ^ ^, ^ ^ 

M. v.. P„ 3„ oy. ^ - 

- ... v.. ... hL - 

n, 0X„ I.. ^, ^, ^ « 

iUa Glu He His Thr Ai^ tIm ti . 

„ M. Glu u, Si, ai, Tte s„ ^„ 

^ W p„ 3„ P„ ^. ^ ^ eo 

3» »^ - ^„ » 

T.. .J,. ^„ ^ ^ no ^ 

%s ^, ^„ ^ ^„ 



wo 02/00174 



81 



PCT/USOl/21065 



130 135 140 

Ser Ala Trp Leu Asp Ser Gly Val Thr Gly Ser Gly Leu Glu Gly Asp 
145 150 155 160 

His Leu Ser Asp Thr Ser Thr Thr Ser Leu Glu Leu Asp Ser Arg Arg 
165 170 175 

His 



<210> 166 
<211> 177 
<212> PRT 

<213> Homo sapiens 



<400> 166 



Met 


Gin 


Arg Arg Leu Val 


Gin Gin Trp Ser 


Val Ala Val 


Phe Leu Leu 


1 




5 


10 




15 


Ser 


Tyr 


Ala Val Pro Ser 


Cys Gly Arg Ser 


Val Glu Gly 


Leu Ser Arg 






20 


25 




30 


Arg 


Leu 


Lys Arg Ala Val 


Ser Glu His Gin 


Leu Leu His 


Asp Lys Gly 






35 


40 


45 




Lys 


Ser 


lie Gin Asp Leu 


Arg Arg Arg Phe 


Phe Leu His 


His Leu He 




50 




55 


60 




Ala 


Glu 


He His Thr Ala 


Glu He Arg Ala 


Thr Ser Glu 


Val Ser Pro 


65 




70 




75 


80 


Asn 


Ser 


Lys Pro Ser Pro 


Asn Thr Lys Asn 


His Pro Val 


Arg Phe Gly 






85 


90 




95 


Ser 


Asp 


Asp Glu Gly Arg 


Tyr Leu Thr Gin 


Glu Thr Asn 


Lys Val Glu 






100 


105 




110 


Thr 


Tyr 


Lys Glu Gin Pro 


Leu Lys Thr Pro 


Gly Lys Lys 


Lys Lys Gly 






115 


120 


125 




Lys 


Pro 


Gly Lys Arg Lys 


Glu Gin Glu Lys 


Lys Lys Arg 


Arg Thr Arg 




130 




135 


140 




Ser 


Ala 


Tirp Leu Asp Ser 


Gly Val Thr Gly 


Ser Gly Leu 


Glu Gly Asp 


145 




150 




155 


160 


His 


Leu 


Ser Asp Thr Ser 


Thr Thr Ser Leu 


Glu Leu Asp 


Ser Arg Arg 






165 


170 




175 



His 



<210> 167 

<211> 3362 

<212> DMA 

<213> Homo sapiens 

<400> 167 

cacaatgtat gcagcaggct cagtgtgagt 

aaggagcatt gcaggtccta tttgcaacct 

ttcagaactc ccattcctgg gagctggagt 

gctcattgca attaatcctc aggtacctga 

aatgataact gaagcttcat tttacctatt 

aaatataaag attttaatac ctgccacatg 

agaatcatat gaaaaggcaa atgtcatagt 

tccatacacc ctacaataca gagggtgtgg 

taatttccta ctgaatgata acttaacagc 

ccatgaatgg gcccacctcc gttggggtgt 

ctacataaat gggcaaaatc aaattaaagt 



gaactggagg cttctctaca acatgaccca 60 
gaagtttgtg actctcctgg ttgccttaag 120 
acagcttcaa gacaatgggt ataatggatt 180 
gaatcagaac ctcatctcaa acattaagga 240 
taatgctacc aagagaagag tatttttcag 300 
gaaagctaat aataacagca aaataaaaca 360 
gactgactgg tatggggcac atggagatga 420 
aaaagaggga aaatacattc atttcacacc 480 
tggctacgga tcacgaggcc gagtgtttgt 540 
gttcgatgag tataacaatg acaaaccttt 600 
gacaaggtgt tcatctgaca tcacaggcat 660 
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lUZTtfo llTtlllll T^^ltllTo ™2 ^"-^^^^ a.ctttttaa 7.0 
gcaaagttta tcttctgtgg ttgaat?t?g tStn^S^f actgcatcaa taatgttcat 780 
aaacctacag aaccagatgt gcagcctoa? aaSl^^^^ acccacaacc aagaagcacc 840 
tgactttcac cacag^tt?c ccaCcgg glfttZllt lllft^'T "^-^^-tgc 900 
gcttgtagag gctggtgaca aagtggtc?g ??SoSct^ nttT^'''''' ccacattctc 960 
taaS^^^ agactccttc aactacaaca agcSS^ t'tSS^^ ^^aagatggc 1020 
tgaaattcat accttcgtgg gcattgccag tSacaan tgcagattgt 1080 

gctacaccaa attaacagca atgatgatcg aajattac^a nttf^T?^ tcagagccca 1140 
tgtatcaget aaaacagaca tcagcatttg ttclll^^lf gtttcatatc tgcccaccac 1200 
tgaaaaactg aatggaaaag cttatggcJc JgtSa TttT^'^^ "gaggtggt 1260 
taagcttctt ggcaattgct tacccactgt gSSaSot lltf^^''''^ gcggagatga 1320 
tgccctgggt tcatctgcag ccccaaatS gSSS! ^^"^f^^^ ttcactccat 1380 
aaagttcttt gttccagata tatcaaactc c!a?aacSa atf^^*!^ caggaggttt 1440 
ttcctctgga actggagaca ttttccagca acata?tr^n attgatgctt tcagtagaat 1500 
tgtcaaacct caccatcaat tgaaaaa?ac aSaa^Jo^^ cttgaaagta caggtgaaaa 1560 
cactatgttt ctagttacgt. ggcaggcSg ZlalllJ ItlttlTf ^^^^-^^^'3^ 1^20 
tgatggacga aaatactaca caaataattt ta?cae;^»^ gagattatat tatttgatcc 1680 
tctttggatt ccaggaacag ctaagcctgg gcacSaaS .^f''^^^^ ggacagctag 1740 
. ccatgcaaaa ttattgacct ggaagctg^I aaaSf! tacaccctga tgtgtttcca 1800 
acagcacctg gagaagactt JgatLg^c cagStaJaa S"^"" cctatcttgg i860 
aaaagtctac agaatatcca agatgac??t aacaata^^f f^^!^^^^^ aagaatgagt 1920 
cgaaatcctc agcaagctgg citcagggag aJamS ""^^^aaa tacatcaaag 1980 
aatggacctg aacatcagcc aaatggagaa acjSJoaS aatttccacg 2040 

atacgagcaa tggataggaa ctcc??acag tSgc^rtat ttt ft^"" ttatgttgca 2100 " 
• ctgtttattc cccccaattc tgatcctata Itln^l^ ctaacattgc ccaggcgcct 2160 
gttttaacag caatgggttt gataggafte TttglTtl t%ataS.'%' ^^^gaaa'gga 2220 
actttaagca ggaaaaagaq acrcaaar;,^^ -ttcgcctta ttatagttgt gacacatcat 2280 
aatatccaaa gtgtcttcc? tctt^gata? aaaaSS ^'T.'^^^' attataaata 2340 
actaacaaag tcaaattaac atcaaaJc^i tllS^tt gccttcgact acaaaaacat 2400 
cagataagat ttttacatgg tagatcaaca aattofff^v '^^"a^Slttt ttgtacaata 2460 
ttacactttg gctatgaaca aaLSaaaa attlftaltt ^^'^f^^^^ tagaaaaccc 2520 
aaagggaagg gtaaagtcgg accagtgtca aggaaJS^^ a^'^^.^!^* ctttaaaggc 2580 
tagccccaag cagagaaaag gagggtagg? c?acaSJ= ^ ggtggaaaaa 2640 

atttagttao tttgattaat ttttctt?S tccttaJS^ actgtctgtg tgaagcaatc 2700 
tacaactgaa gatcatgcta tatttcatat at^.! ^ tgcagaacag gttgcttgtt 2760 
cttgctattt tgttatltat attacaS?J Jaftnf''''? ""^I'^^^^^'S ctctttacct 2820 
tttcactgta agaggtaacc tttlTclTa t^gSaSac ^c^ltVft ^^^^atcttt 2880 
tttatgacaa aggtctattg aatttattta tt?^f I f f ftttgtctct tcataccggt 2940 
tttctaagtt attgccttgg ttattata^! !f ^ tctactccca tcaaagcagc 3000 

taaggaagaa aagatgtt^? tc?gaS??^ l^llT.ll S"^'^ atgcct'taa'c 3060 
ttcaattaaa ccaaagaaga ggtcaacaaa a^^^f f ^^^^tgaaca tatagtttta 3120 
gctctgtttt ttggttaaat SgaSct?? aScctS'^ -ctttggaaa tgattagctg 3180 
agggcagggg aagggggata tagaggtcJc iToaTalltl f^^^^f^ag ttaottacca 3240 
ttttactect tcctcttatt tttVLa^^ a^^^^^^^ llll 

<210> 168 ^^^^ 

<211> 2784 

<212> DMA 

<213> Homo sapiens 



<4 00> 168 



^SSSI lllllltlll ZllT'tZ T^'°"S '-^^"ot, eo 

- i =S -sd So° 

,«=ctc. i^^^ t„ ^^^^ 
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taccaagaga agagtatttt tcagaaatat aaagatttta atacctgcca catggaaagc 360 
taataataac agcaaaataa aacaagaatc atatgaaaag gcaaatgtca tagtgactga 420 
ctggtatggg gcacatggag atgatccata caccctacaa tacagagggt gtggaaaaga 480 
gggaaaatac attcatttca cacctaattt cctactgaat ga'taacttaa cagctggcta 540 
cggatcacga ggccgagtgt ttgtccatga atgggcccac ctecgttggg gtgtgttcga 600 
tgagtataac aatgacaaac ctttctacat aaatgggcaa aatcaaatta aagtgacaag 660 
gtgttcatct gacatcacag gcatttttgt gtgtgaaaaa ggtccttgcc cccaagaaaa 720 
ctgtattatt agtaagcttt ttaaagaagg atgcaccttt atctacaata gcacccaaaa 780 
tgcaactgca tcaataatgt tcatgcaaag tttatcttct gtggttgaat tttgtaatgc 840 
aagtacccac aaccaagaag caccaaacct acagaaccag atgtgcagcc tcagaagtgc 900 
atgggatgta atcacagact ctgctgactt tcaccacagc tttcccatga acgggactga 960 
gcttccacct cctcccacat tctcgcttgt agaggctggt gacaaagtgg tctgtttagt 1020 
gctggatgtg tccagcaaga tggcagaggc tgacagactc cttcaactac aacaagccgc 1080 
agaattttat ttgatgcaga ttgttgaaat tcataccttc gtgggcattg ccagtttcga 1140 
cagcaaagga gagatcagag cccagctaca ccaaattaac agcaatgatg atcgaaagtt 1200 
gctggtttca tatctgccca ccactgtatc agctaaaaca gacatcagca tttgttcagg 1260 
gcttaagaaa ggatttgagg tggttgaaaa actgaatgga aaagcttatg gctctgtgat 1320 
gatattagtg accagcggag atgataagct tcttggcaat tgcttaccca ctgtgctcag 1380 
cagtggttca acaattcact ccattgccct gggttcatct gcagccccaa atctggagga 1440 
attatcacgt cttacaggag gtttaaagtt ctttgttcca gatatatcaa actccaatag 1500 
catgattgat gctttcagta gaatttcctc tggaactgga gacattttcc agcaacatat 1560 
tcagcttgaa agtacaggtg aaaatgtcaa acctcaccat caattgaaaa acacagtgac 1620 
tgtggataat actgtgggca acgacactat gtttctagtt acgtggcagg ccagtggtcc 1680 
tcctgagatt atattatttg atcctgatgg acgaaaatac tacacaaata attttatcac 1740 
caatctaact tttcggacag ctagtctttg gattccagga acagctaagc ctgggcactg 1800 
gacttacacc ctgaacaata cccatcattc tctgcaagcc ctgaaagtga cagtgacctc 1860 
tcgcgcctcc aactcagctg tgcccccagc cactgtggaa gcctttgtgg aaagagacag 1920 
cctccatttt cctcatcctg tgatgattta tgccaatgtg aaacagggat tttatcccat 1980 
tcttaatgcc actgtcactg ccacagttga gccagagact ggagatcctg ttacgctgag 2040 
actccttgat gatggagcag gtgctgatgt tataaaaaat gatggaattt actcgaggta 2100 
ttttttctcc tttgctgcaa atggtagata tagcttgaaa gtgcatgtca atcactctcc 2160 
cagcataagc accccagccc actctattcc agggagtcat gctatgtatg taccaggtta 2220 
cacagcaaac ggtaatattc agatgaatgc tccaaggaaa tcagtaggca gaaatgagga 2280 
ggagcgaaag tggggcttta gccgagtcag ctcaggaggc tccttttcag tgctgggagt 2340 
tccagctggc ccccaccctg atgtgtttcc accatgcaaa attattgacc tggaagctgt 2400 
aaatagaaga ggaattgacc ctatcttgga cagcacctgg agaagacttt gatcagggcc 2460 
aggctacaag ctatgaaata agaatgagta aaagtctaca gaatatccaa gatgacttta 2520 
acaatgctat tttagtaaat acatcaaagc gaaatcctca gcaagctggc atcagggaga 2580 
tatttacgtt ctcaccccaa atttccacga atggacctga acatcagcca aatggagaaa 264 0 
cacatgaaag ccacagaatt tatgttgcaa tacgagcaat ggataggaac tccttacagt 2700 
ctgctgtatc taacattgcc caggcgcctc tgtttattcc ccccaattct gatcctgtac 2760 
ctgccagaga ttatcttata ttga 2784 

<210> 169 
<211> 592 
<212> PRT 

<213> Homo sapiens 
<400> 169 

Met Thr Gin Arg Ser lie Ala Gly Pro lie Cys Asn Leu Lys Phe Val 

15 10 15 

Thr Leu Leu Val Ala Leu Ser Ser Glu Leu Pro Phe Leu Gly Ala Gly 

20 25 30 

Val Gin Leu Gin Asp Asn Gly Tyr Asn Gly Leu Leu lie Ala He Asn 

35 40 45 

Pro Gin Val Pro Glu Asn Gin Asn Leu He Ser Asn He Lys Glu Met 

50 55 60 

He Thr Glu Ala Ser Phe Tyr Leu Phe Asn Ala Thr Lys Arg Arg Val 
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65 

Phe Phe 

Asn Asn 

Val Thr 

Tyr Arg 
130 
Phe Leu 
145 

Val Phe 



Arg Asn 

Ser Lys 
100 
Asp Trp 
115 

Gly Cys 
Leu Asn 
Val His 



Tyr Asn 

Val Thr 

Gly Pro 
210 
Gly Cys 
225 

Met Phe 
Thr His 
Arg Ser 



Phe Pro 
290 
Val Glu 
305 

Lys Met 

Phe Tyr 

Ser Phe 

Ser Asn 
370 
Ser Ala 
385 

Glu Val 



Asn Asp 
180 
Arg Cys 
195 

Cys Pro 

Thr Phe 

Met Gin 

Asn Gin 
260 
Ala Trp 
275 

Met Asn 



70 

lie Lys 
85 

lie Lys 

Tyr Gly 

Gly Lys 

Asp Asn 
150 
Glu Trp 
165 

Iiys Pro 



He Leu 
Gin Glu 



75 

He Pro Ala 
90 

Ser Tyr Glu 
105 

Gly Asp Asp 



Ala His 

120 

Glu Gly Lys Tyr lie 
135 

Leu Thr 



80 

Thr Trp Lys Ala Asn 

95 

Lys Ala Asn Val He 
110 

Thr Leu Gin 
Thr Pro Asn 



Ala His 
Phe Tyr 



Ser Ser 

Gin Glu 

He Tyr 
230 
Ser Leu 
245 

Glu Ala 



Ala Gly Tyr 
155 

Leu Arg Trp 

170 
He Asn Gly 
185 

Thr Gly He 



Pro Tyr 
125 
His Phe 
140 

Gly Ser 



Gly Val 
Gin Asn 



Asp He 
200 

Asn Cys He He Ser 
215 

Asn Ser 



Ser Ser 
Pro Asn 



Asp Val 

Gly Thr 



Ala Gly 

Ala Glu 

Leu Met 
340 
Asp Ser 
355 

Asp Asp 



Asp Lys 
310 
Ala Asp 
325 

Gin He 
Lys Gly 
Arg Lys 



He Thr 

280 
Glu Leu 
295 

Val Val 



Thr Gin Asn 
235 

Val Val Glu 

250 
Leu Gin Asn 
265 

Asp Ser Ala 
Pro Pro Pro 



Phe Val 
205 
Lys Leu 
220 

Ala Thr 



Arg Gly Arg 
160 

Phe Asp Glu 

175 
Gin He Lys 
190 

Cys Glu Lys 
Phe Lys Glu 



Ala Ser He 

Phe Cys Asn Ala Ser 
255 

Gin Met Cys Ser Leu 
270 

His His Ser 
Phe Ser Leu 



Lys Thr 
Val Glu 



Leu Val 

Val Leu 

Ala Ala 
450 
Phe Phe 

465 

Ser Arg 
Leu Glu 
Thr Val 
Thr Trp 



Thr Ser 
420 
Ser Ser 
435 

Pro Asn 

Val Pro 

He Ser 

Ser Thr 
500 
Thr Val 
515 

Gin Ala 



Asp He 
390 
Lys Leu 
405 

Gly Asp 

Gly Ser 

Leu Glu 

Asp He 
470 
Ser Gly 
485 

Gly Glu 



Arg Leu 

Val Glu 

Glu He 
360 
Leu Leu 
375 

Ser He 



Cys Leu Val 
315 

Leu Gin Leu 

330 
He His Thr 
345 

Arg Ala Gin 



Asp Phe 
285 
Pro Thr 
300 

Leu Asp 



Gin Gin 
Phe Val 



Leu His 
365 

Val Ser Tyr Leu Pro 
380 

Leu Lys 



Val Ser Ser 
320 

Ala Ala Glu 

335 
Gly He Ala 
350 

Gin He Asn 

Thr Thr Val 



Asn Gly 

Asp Lys 

Thr He 
440 
Glu Leu 
455 

Ser Asn 
Thr Gly 
Asn Val 



Cys Ser Gly 

395 

Lys Ala Tyr 

410 
Leu Leu Gly 
425 

His Ser He 

Ser Arg Leu 

Ser Asn Ser 
475 

Asp He Phe 

490 
Lys Pro His 
505 

Gly Asn Asp 



Lys Gly Phe 
400 

Gly Ser Val Met He 
415 

Asn Cys Leu Pro Thr 
430 

Gly Ser Ser 
Gly Leu Lys 



Ala Leu 
445 
Thr Gly 
460 
Met He 



Asp Asn Thr Val 
520 

Ser Gly Pro Pro Glu He He 



Gin Gin 

His Gin 

Thr Met 
525 
Leu Phe 



Asp Ala Phe 
480 

His He Gin 

495 
Leu Lys Asn 
510 

Phe Leu Val 



Asp Pro Asp 
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530 


535 






540 




Gly Arg Lys 


Tyr Tyr Thr Asn 


Asn Phe 


lie 


Thr Asn Leu 


Thr Phe Arg 


545 


.550 






555 


560 


Thr Ala Ser 


Leu Trp lie Pro 


Gly Thr 


Ala 


Lys Pro Gly 


His Trp Thr 




565 




570 




575 


Tyr Thr Leu 


Met Cys Phe His 


His Ala 


Lys 


Leu Leu Thr 


Trp Lys Leu 




580 


585 






590 



<210> 170 
<211> 791 
<212> PRT 

<213> Homo sapiens 



<400> 170 



Met 


Thr 


Gin 


Arg 


Ser lie Ala 

h'v^ j.^^ fx^a 


Gly Pro 


He 


Cvs Asn L^u 


Lys 


Phe Val 


1 








5 






10 

X \J 






15 


Thr 


Leu 


Leu 


Val 


Ala Ti^ii S^T* 
rxxci xjcu tDxsx. 


Ser 


Glu 


Leu 


PT*r» PHo T.eii 

JTXW XriiC Xl\3Lt 


Gly Ala Gly 








20 






25 






30 




Val 


Gin 


Leu 


Gin 


XlO^ fXOll \D^jf 


Tyr Asn 


Gly 


XlCU XlCU xxc 


Ala 


He Asn 






35 






40 












Pro 


Gin 






V3J.U Aoll OXXl 


Asn 


Leu 


Tl #» 


OCX Aoll XX€ 


Lys Glu Met 




50 














fiO 






lie 


Thr 


Glu 




Sfiy PVi» T\7T* 
ljksa. iriic ^j-^ 


Leu 


Phe 




riXd liix Jjyo 


Arg Arg Val 


65 








70 












80 


Phe 


Phe 


Arg 


Asn 


Tlo T.\7<a 

XXC JJjfO xxc 


Leu 


He 


irx \J 


xlXd XXIX J.xp 


Lys Ala Asn 
















QO 






95 


Asn 


Asn 


Ser 




Tl^ LiV^ t^l n 


Glu 


Ser 




(Zl 11 T.\7Q Al a 
wxu uyss AXd 


Asn 


Val He 








1 no 






105 






110 




Val 


Thr 






TvT* m \7 Al A 
X j^x wx fxxa 


His 


Gly 




Dv/^ Tttt* 
/lop Jrx<J <Lyx 


Thr 


Leu Gin 






J. 






120 












Tyr 


Arg 


Gly 


Cys 


Gly Lys Glu 


Gly Lys 


Tvr 


He His Phe 


Thr 


Pro Asn 




130 






135 








140 






Phe 


Leu 


Leu 


Asn 


Asp Asn Leu 


Thr 


Ala 


Gly 


Tyr Gly Ser 


Arg Gly Arg 


145 








150 








155 




160 


Val 


Phe 


Val 


His 


Glu Trp Ala 


His 


Leu 


Arg 


Trp Gly Val 


Phe Asp Glu 










165 






170 






175 


Tyr 


Asn 


Asn 


Asp 


Lys Pro Phe 


Tyr 


He 


Asn 


Gly Gin Asn 


Gin 


He Lys 








180 






185 






190 




Val 


Thr 


Arg 


Cys 


Ser Ser Asp 


He 


Thr 


Gly 


He Phe Val 


Cys 


Glu Lys 






195 






200 






205 






Gly 


Pro 


Cys 


Pro 


Gin Glu Asn 


Cys 


He 


He 


Ser Lys Leu 


Phe 


Lys Glu 




210 






215 








220 






Gly 


Cys 


Thr 


Phe 


lie Tyr Asn 


Ser 


Thr 


Gin 


Asn Ala Thr 


Ala 


Ser He 


225 








230 








235 




240 


Met 


Phe 


Met 


Gin 


Ser Leu Ser 


Ser 


Val 


Val 


Glu Phe Cys 


Asn 


Ala Ser 










245 






250 






255 


Thr 


His 


Asn 


Gin 


Glu Ala Pro 


Asn 


Leu 


Gin 


Asn Gin Met 


Cys 


Ser Leu 








260 






265 






270 




Arg 


Ser 


Ala 


Trp 


Asp Val lie 


Thr Asp 


Ser 


Ala Asp Phe 


His 


His Ser 






275 






280 






285 






Phe 


Pro 


Met 


Asn 


Gly Thr Glu 


Leu 


Pro 


Pro 


Pro Pro Thr 


Phe 


Ser Leu 




290 






295 








300 . 






Val 


Glu 


Ala 


Gly 


Asp Lys Val 


Val 


Cys 


Leu 


Val Leu Asp 


Val 


Ser Ser 


305 








310 








315 




320 


Lys 


Met 


Ala 


Glu 


Ala Asp Arg 


Leu 


Leu 


Gin 


Leu Gin Gin 


Ala 


Ala Glu 










325 






330 






335 


Phe 


Tyr 


Leu 


Met 


Gin lie Val 


Glu 


He 


His 


Thr Phe Val 


Gly He Ala 
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Ser Phe 

Ser Asn 
370 
Ser Ala 
385 

Glu Val 

Leu Val 

Val Leu 

Ala Ala 
450 
Phe Phe 
465 

Ser Arg 

Leu Glu 

Thr Val 

Thr Trp 
530 
Gly Arg 
545 

Thr Ala 

Tyr Thr 

Val Thr 

Ala Phe 
610 
Tyr Ala 
625 

Thr Ala 

Leu Asp 

Ser Arg 

Val His 
690 
Pro Gly 
705 

lie Gin 

Arg Lys 

Leu Gly 

He lie 
770 
Asp Ser 
785 



340 

Asp Ser Lys Gly Glu He Arg Ala 
360 

Asp Asp Arg Lys Leu Leu Val Ser 
375 

Lys Thr Asp He Ser He Cys Ser 
390 

Val Glu Lys Leu Asn Gly Lys Ala 
^05 

!on Leu 
425 

Ser Ser Gly ser Thr He His Ser 
440 

Pro Asn Leu Glu Glu leu Ser Arg 
455 

Val Pro Asp He Ser Asn Ser Asn 
470 

lie Ser Ser Gly Thr Gly Asp He 

^85 AQQ 

Ser Thr Gly Glu Asn Val Lys Pro 

505 

Thr Val Asp Asn Thr Val Gly Asn 
^, 520 
Gin Ala Ser Gly Pro Pro Glu He 

535 

Lys Tyr Tyr Thr Asn Asn Phe He 
550 

Ser Leu Trp He Pro Gly Thr Ala 

565 

Leu Asn Asn Thr His His Ser Leu 

585 

Ser Arg Ala Ser Asn Ser Ala Val 
600 

Val Glu Arg Asp Ser Leu His Phe 

615 

Asn Val Lys Gin Gly Phe Tyr Pro 

630 

Thr Val Glu Pro Glu Thr Gly Asp 
645 

Asp Gly Ala Gly Ala Asp Val He 

665 

Tyr Phe Phe Ser Phe Ala Ala Asn 
680 

Val Asn His Ser Pro Ser He Ser 
695 

Ser His Ala Met Tyr Val Pro Gly 
710 ' 
Met Asn Ala Pro Arg Lys Ser Val 

^""^ 7!. ^^^ ^9 val ser Ser 

745 

Val Pro Ala Gly Pro His Pro Asp 
i . 760 
Asp Leu Glu Ala" Val Asn Arg Arg 
775 

Thr Trp Arg Arg Leu 
790 



350 

Gin Leu His Gin He Asn 
365 

Tyr Leu Pro Thr Thr Val 
380 

Gly Leu Lys Lys Gly Phe 

m 400 
Tyr Gly Ser Val Met He 

415 

Gly Asn Cys Leu Pro Thr 
430 

He Ala Leu Gly Ser Ser 
445 

Leu Thr Gly Gly Leu Lys 
460 

Ser Met He Asp Ala Phe 
480 

Phe Gin Gin His He Gin 

4 95 

His His Gin Leu Lys Asn 
510 

Asp Thr Met Phe Leu Val 

525 ' 

He Leu Phe Asp Pro Asp 
540 

Thr Asn Leu Thr Phe Ara 
560 

Lys Pro Gly His Trp Thr 
575 

Gin Ala Leu Lys Val Thr 
590 

Pro Pro Ala Thr Val Glu 
605 

Pro His Pro Val Met He 
620 

He Leu Asn Ala Thr Val 

640 

Pro Val Thr Leu Arg Leu 
655 

Lys Asn Asp Gly He Tyr 
670 

Gly Arg Tyr Ser Leu Lys 
685 

Thr Pro Ala His Ser He 
700 

Tyr Thr Ala Asn Gly Asn 

Z?^ 720 
Gly Arg Asn Glu Glu Glu 

735 

Gly Gly Ser Phe Ser Val 
750 

Val Phe Pro Pro Cys Lys 
765 

Gly He Asp Pro He Leu 
780 
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<210> 171 

<211> 1491 

<212> DNA 

<213> Homo sapiens 

<400> 171 

cctcctgcca gccaagtgaa gacatgctta cttccccttc accttccttc atgatgtggg 60 
aagagtgctg caacccagcc ctagccaacg ccgcatgaga gggagtgtgc cgagggcttc 120 
tgagaaggtt tctctcacat ctagaaagaa gcgcttaaga tgtggcagcc cctcttcttc 180 
aagtggctct tgtcctgttg ccctgggagt tctcaaattg ctgcagcagc ctccacccag 240 
cctgaggatg acatcaatac acagaggaag aagagtcagg aaaagatgag agaagttaca 300 
gactctcctg ggcgaccccg agagcttacc attcctcaga cttcttcaca tggtgctaac 360 
agatttgttc ctaaaagtaa agctctagag gccgtcaaat tggcaataga agccgggttc 420 
caccatattg attctgcaca tgtttacaat aatgaggagc aggttggact ggccatccga 480 
agcaagattg cagatggcag tgtgaagaga gaagacatat tctacacttc aaagctttgg 540 
agcaattccc atcgaccaga gttggtccga ccagccttgg aaaggtcact gaaaaatctt 600 
caattggact atgttgacct ctatcttatt cattttccag tgtctgtaaa gccaggtgag 660 
gaagtgatcc caaaagatga aaatggaaaa atactatttg acacagtgga tctctgtgcc 720 
acatgggagg ccatggagaa gtgtaaagat gcaggattgg ccaagtccat cggggtgtcc 780 
aacttcaacc acaggctgct ggagatgatc ctcaacaagc cagggctcaa gtacaagcct 840 
gtctgcaacc aggtggaatg tcatccttac ttcaaccaga gaaaactgct ggatttctgc 900 
aagtcaaaag acattgttct ggttgcctat agtgctctgg gatcccatcg agaagaacca 960 
tgggtggacc cgaactcccc ggtgctcttg gaggacccag tcctttgtgc cttggcaaaa 1020 
aagcacaagc gaaccccagc cctgattgcc ctgcgctacc agctgcagcg tggggttgtg 1080 
gtcctggcca agagctacaa tgagcagcgc atcagacaga acgtgcaggt gtttgaattc 1140 
cagttgactt cagaggagat gaaagccata gatggcctaa acagaaatgt gcgatatttg 1200 
acccttgata tttttgctgg cccccctaat tatccatttt ctgatgaata ttaacatgga 1260 
gggcattgca tgaggtctgc cagaaggccc tgcgtgtgga tggtgacaca gaggatggct 1320 
ctatgctggt gactggacac atcgcctctg gttaaatctc tcctgcttgg cgacttcagt 1380 
aagctacagc taagcccatc ggccggaaaa gaaagacaat aattttgttt ttcattttga 1440 
aaaaattaaa tgctctctcc taaagattct tcacctaaaa aaaaaaaaaa a 1491 

<210> 172 
<211> 364 
<212> PRT 
<213> Homo sapiens 



<400> 172 



Met Trp 


Gin 


Pro 


Leu 


Phe 


Phe 


Lys 


Trp 


Leu 


Leu 


Ser 


Cys 


Cys 


Pro Gly 


1 






5 










10 










15 


Ser Ser 


Gin 


He 


Ala 


Ala 


Ala 


Ala 


Ser 


Thr 


Gin 


Pro 


Glu 


Asp 


Asp He 






20 










25 










30 


Asn Thr 


Gin 


Arg 


Lys 


Lys 


Ser 


Gin 


Glu 


Lys 


Met 


Arg 


Glu 


Val 


Thr Asp 




35 










40 










45 




Ser Pro 


Gly 


Arg 


Pro 


Arg 


Glu 


Leu 


Thr 


He 


Pro 


Gin 


Thr 


Ser 


Ser His 


50 










55 










60 








Gly Ala 


Asn 


Arg 


Phe 


Val 


Pro 


Lys 


Ser 


Lys 


Ala 


Leu 


Glu 


Ala 


Val Lys 


65 








70 










75 








80 


Leu Ala 


He 


Glu 


Ala 
85 


Gly 


Phe 


His 


His 


He 
90 


Asp 


Ser 


Ala 


His 


Val Tyr 
95 


Asn Asn 


Glu 


Glu 
100 


Gin 


Val 


Gly 


Leu 


Ala 
105 


He 


Arg 


Ser 


Lys 


He 
110 


Ala Asp 


Gly Ser 


Val 
115 


Lys 


Arg 


Glu 


Asp 


He 
120 


Phe 


Tyr 


Thr 


Ser 


Lys 
125 


Leu 


Trp Ser 


Asn Ser 


His 


Arg 


Pro 


Glu 


Leu 


Val 


Arg 


Pro 


Ala 


Leu 


Glu 


Arg 


Ser Leu 


130 










135 










140 






Lys Asn 


Leu 


Gin 


Leu 


Asp 


Tyr 


Val 


Asp 


Leu 


Tyr 


Leu 


He 


His 


Phe Pro 
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150 

155 



V.I se. V.I p„ ,eo 
Ly= u. P., ^ ^« „3 

L,, Cy. ^. j„ 

„U ^„ ^„ ^ 20, 

vu cv. ^„ 

L,. ^ ^^^^ 

Tyr Ser Ala Leu Gly Ser Hio ttr-r. ^i i^? 255 

260 ^ ^'^'^ P~ Trp Val Asp Pro Asn 

Ser Pro Val Leu Leu Glu Asn Pr-n vlf t ^ 270 

275 Lys Lys 

His Lys AT. Pro Ala Leu He Ala Leu Ar. Xyr III .eu 31„ Arg 
Val Val Val .u Ala Ly. Se. .yr Asn Olu 11^ Ar. xie Ar. Gin 
Asn val Gin Val P.e Glu P.e Gl„ Leu ^r III gIu GIu Met Lys 111 
lie ASP Gly Leu AS. AT, Asn Val Ar. .eu Xhr Leu Asp S p.e . 

Ala Gly Pro Pro Asn Tyr Pro Phe Ser Asp Glu Tyr 
•^^^ 360 



350 



<210> 173 
<211> 1988 
<212> DNA 
<213> Homo sapiens 

<400> 173 



cgggagecgc ctccccgcgg cctcttcnr-*- ^-1-4.^4. 

tctctgctgt cgcccgtccc gcgcgS^S ccSfcS ^^^^^cgct cgcaggccac 60 
ccgcgccgcc cgtcaacatg Itccgctgcg gcSoacS ^^^^^^ccgct ccgctcggcc 120 
tgcccctgct cctactcagc gcca?cg?c? ?cSSS? ^^f cgctggatcc 180 
ggttgcagtc tagcgaccac ggccagLgt SS^^^ <=gcgctggcc ggccgcggct 240 
gcggcggcag ogggtcctac gaggagggS q^aaS.S ^^^gaaatgc tcccaagagg 300 
gagcagcggc tgccatgcto ttc^ggct ?catca?^^ <=atggagtac gcgtggggta 360 
ccttcttcgc cctctgtgga ccccIgSgc ttaloTtcTt ^^^^*^*^5t ttcatcctct 420 
ttgcottggc tgctgtgttc cagatLtc? ccSSL^^ gagagtgatt ggaggtctcc 480 
agaocttcac ccttcatgcc aaccctgctg Sc^Sc^J ttaccccgtg aagtacaccc 540 
ttgggtgggc agccacgatt atcctgatcg gcJSano^^ ctataactgg gcctacggct 600 
actacgaaga tgaccttctg ggcaa?gccf agcSoSJ 2^''!''''^^ tgcctcccca 660 
ttgggaatga atgtgggaga aaatcgctgc StS^S^ cttctacaca tctgcctaac 720 
tttctccagg cgactttgaa cccatttttt ggcaotaS^ f^'^f^fS^^ gaagaaactg 780 
aatgctaaaa taatttggga gaaaatattt ??t2ataaf aotagtcaaa 840 

ttttattatg ttttgtgaag ttgtgtcttt t^^^^^ !H gttatagttt catgtttatc 900 
ccttatatct atccataaca ttStactac aSSt'Saa ^^^^^^^t^* gccaatattt 960 
ttt2^!%'^ ^ggtaaaaat gaggtttcca ZlHttltl ItT''^'^'' gtgaaactta 1020 
tttccaaata gaatggactt ggtctgttaa qaocta«n« ^tctgatcaa gttcttgtta 1080 
aagttgttaa tgaccaaaca ttctaaaagS lltTcltttt ^^^^aggaag ataaggttaa 1140 
tcgaactatt taaggaaagc aaaatcat?t cctaSta^ aaaagtttat tttcaagcct 1200 
cattaatato ctgaatcatt catttcagc^ SoctS^ tatcatttgt gagaatttct 1260 
aggaaagtac tatttcatgg tccaaacrtg ttll^lTt ^ f ^'"^^^ tatgtcatct 1320 

g T:tgccatagt tggtaaggct ttcctttaag 1380 
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tgtgaaatat ttagatgaaa ttttctcttt 

aaatgctata ttaataaatc tgtagtgttt 

tggattgaaa gatggactgg gtctaattta 

agtaaagcat taggagggtc attcytgtca 

ataaatgact tgcttttcta aatctcaggt 

ctgatagttt gcarctgtaa gcagaaacct 

taaacagatt ttaaatgtct gatataaaac 

tctctgaata gcatatatat gatgcatcgg 

cttacataat gaaaaccaat tcattttaaa 

aaagctaatt gtagttttca ttatgaagtt 
aaaaaaaa 



taaagttctt tatagggtta gggtgtggga 1440 
tgtgtttata tgttcagaac cagagtagac 1500 
tcatgactga tagatctggt taagttgtgt 1560 
caaaagtgcc actaaaacag cctcaggaga 1620 
ttatctgggc tctatcatat agacaggctt 1680 
acatatagtt aaaatcctgg tctttcttgg 1740 
atgccacagg agaattcggg gatttgagtt 1800 
ataggtcatt atgatttttt accatttcga 1860 
tatcagatta ttattttgta agttgtggaa. i920 
ttcccaataa accaggtatt ctaaaaaaaa- 1980 

1988 



<210> 174 
<211> 238 
<212> PRT 
<213> Homo sapiens 



<400> 174 



Gly 


Ala 


Ala 


Ser Pro 


Arg 


Prrt Levi 


1 






5 






Arg 


Arg 


Pro 


Leu Ser 


Ala 


Val Ala 








20 






Leu 


Arg 


Ser 


Ala Pro 


Leu 


Gly Pro 






35 






40 


Cys 


Gly 


Leu 


Ala Cys 


Glu 


Arg Cys 




50 








55 


Leu 


Ser 


Ala 


He Ma 


Phe 


Asp He 


65 








70 




Leu 


Gin 


Ser 


Ser Asp 


His 


Gly Gin 








85 






Ser 


Gin 


Glu 


Gly Gly Gly 


Ser Gly 








100 






Leu 


Met 


Glu 


Tyr Ala 


Trp 


Gly Arg 






115 






120 


Gly 


Phe 


He 


He Leu 


Val 


He Cys 




130 








135 


Cys 


Gly 


Pro 


Gin Met 


Leu 


Val Phe 


145 








150 




Ala 


Leu 


Ala 


Ala Val 


Phe 


Gin He 








165 






Lys 


Tyr 


Thr 


Gin Thr 


Phe 


Thr Leu 








180 






He 


Tyr 


Asn 


Trp Ala 


Tyr 


Gly Phe 






195 






200 


He 


Gly 


Cys 


Ala Phe 


Phe 


Phe Cys 




210 








215 


Leu 


Leu 


Gly 


Asn Ala Lys 


Pro Arg 


225 








230 





Arg Phe Cys 


Gly 


Gly 


Ala 


Arg 


Ala 


10 








15 




Arg Pro Ala 


Arg 


Ser 


Ser 


Asp 


Pro 


25 






30 






Ala Pro Pro 


Val 


Asn 


Met 


He 


Arg 






45 






Arg Trp He 


Leu 
60 


Pro 


Leu 


Leu 


Leu 


He Ala Leu 


Ala 


Gly 


Arg 


Gly 


Trp 


75 










80 


Thr Ser Ser 


Leu 


Trp 


Trp 


Lys 


Cys 


90 








95 




Ser Tyr Glu 


Glu 


Gly 


Cys 


Gin 


Ser 


105 






110 






Ala Ala Ala 


Ala 


Met 
125 


Leu 


Phe 


Cys 


Phe He Leu 


Ser 
140 


Phe 


Phe 


Ala 


Leu 


Leu Arg Val 


He 


Gly 


Gly 


Leu 


Leu 


155 










160 


He Ser Leu 


Val 


He 


Tyr 


Pro 


Val 


170 








175 




His Ala Asn 


Pro 


Ala 


Val 


Thr 


Tyr 


185 






190 




Gly Trp Ala 


Ala 


Thr 

205 


He 


He 


Leu 


Cys Leu Pro 


Asn 
220 


Tyr 


Glu 


Asp 


Asp 


Tyr Phe Tyr 


Thr 


Ser 


Ala 







235 



<210> 175 

<211> 4181 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<222> 3347, 3502, 3506, 3520, 3538, 3549, 3646, 3940, 3968, 3974, 
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<223> n = A,T,C or G 

<400> 175 



SSalS ttctctttta aaacacatct 60 

ttactgtgtt tgtgtat?tt SggcaS Jaac^f <=tcgttcttg tgcttggtS 120 
tccatcaccg tgggtggttt taamttS t??t?tct?a fff^'^'^^ gctggllLa 180 
actcttcaca atgaacaaac tgtatatcqa aaac^J^S "^'^^ttttt ttaaacaacc 240 
cctagaaagt atcttcaagg acqccaaSt 9agaacgccg ccccctcgga 300 

tggctacgcg ttcgtggac? gScggalga gagSSc? ^?r=<="- tggtgaagac 360 
ttcaggtaaa atagaactgc acggSaacc dl^lUtt f ^^«55<=<=^ tcgaggcgct 420 
gcaaaggatt cggaaacttc agatacoaaa ^IntT^l 9agcactcgg tcccaaaaag 480 
ggatagttta ctagtccagt atggSt gaaaaactot '^^^^^'^^^^ gggaggtgc? 540 
ggaaactgca gttgtaaatg taIcoS??c ^atllllf ^^^^^f ^^ga acactgactc 600 
caaactgaat ggatttcagt tagagaa"t ScrttofL ^^^ctagac aagcactaga 660 
allTT'' *=^9caaaacc crttgcag^a J^SSt Sof''*^*^ tccctgatga 720 
gggctcctca aggcaggggt ctccaggltc StSSffa ^f^^^^^'^gc ttgggcagag 780 
tctgcgcctg ctggttccca cccaatttgt taaa«^^? cagaaaccat gtgatttgcc 840 
cattcggaac atcaccaaac agaccSg^o lllll^calt ^^^gaaaag aaggtgccac 900 
gggggctgct gagaagtcga ttactatcct ct^f ! gtccaccgta aagaaaatgc 960 
taagtctatt ctggagatta tgSagga agSalS f^'^^.T' -^gcggcttg 1020 
Taatc^f attttagctc ataataaSt JgSgS StattT "^aagagat 1080 
aaatcttaaa aaaattgagc aagacacaga cactaS^S f ^^"99ta aagaaggaag 1140 
attgacgctg tataatccag aacgcactS tar!^^^ acgatatctc cattgcagga 1200 
caaagctgag gaggagatca tgaag^alJt cattlllt^^ ggcaatgttg agacatg?gc 1260 
tatgaatctt caagcacatt tLt?cc?gg Ttlllttltl *f ^^!f!*5 atattgcttc 1320 
acccacttca gggatgccac ctcccacc?c ajaaccc^!? aacgccttgg gtctgttccc 1380 
cccgcagttt gagcaatcag aaacggajc trttcatcaa J^f^^^^^^ ctcctcccta 1440 
cggtgccatc atcggcaagc agqqclaoea nf^! ^ tttatcccag ctctatcagt 1500 
ttcaattaag attgctcclg cgSScJ aaaS^?f ^ ctttctcgct ttgctgga?^ iseS 
tggaccacca gaggctcag? tSagJoS gJoaS^f? fgaggatgg tgattatcic 1620 
aaactttgtt agtcctaaag aaqaaato^ Sggaagaatt tatggaaaaa ttaaagaaga 1680 
tgctgctggc agagttattg gSSgSg caaSta ^^^^^^^ tgcca?cc?t 1740 
aagtgcagaa gttgttgtcc ItcgtgacS TaTactalt ^^^^^^c agaatttgtc 1800 
caaaataact ggtcacttct atgctLcS qa^^^^f ^agaatgacc aagtggttgt 1860 
f^r'f " ^-5-5--- aacaacagS ggcSqSa aS^f'^^ ^^^^-^ttJt 1920 
acggaagtaa aggctcagga aacagcccac S2qaq«^« agtggaccac ctcagtcaag 1980 
cSS*'\^? aacagatggg cgctgacSc SS^^a tat^T^^'' ^aagacaga 2040 
ctagccagtt gtttctgagg accaggcaac Sf^^^Ef tcacatgcac aagtttttac 2100 
actttatgct ctctgaaa?g ta^Saccc ScStL ^^^^^^^^^gt gagaatgtat 2160 
aaaagggtgg gggagggagg gaaagaqaaq tl^ll ^ caaacaaaca aacaaacaaa 2220 
cagtataaca gatattrtS ?tc?tc?S tEScf'' "^^'^"^ tgtagtctca 2280 
atgatgcttt cactaaattc atcaaa^aga tTcSlTttt "f^^'^^^^^ aattggctta 2340 
^tt .ttt ^^^^^gga acttaaat^ ufJ^Sta aL'^^"^ taaaattgga 2400 
gttttatttt taoctaacac taacatgagt aacct^^^^ g^atagaaaa actgttctca 2460 
aggggtatta aacgtgcatt tttactL!^ aacctaaggg aagtgctgaa tggtgttqqc 25?0 
gcaaaattgt tccttt^ttj t^a'Jaltm^ a^aSc^S Ja'at^^^^ 
ttttttaaaa aataaattta aaatttaaca qcaatca^^^ l^^^^^^^^^ agtccaaccg 2640 
tacttctggc tggtgacagt aaagctqq^ aacaggcaaa ttaagatttt 2700 

acacagttat tagttaaatc aaaf^Sa ^llltll^" ^^^^^tttt gaggcttttg 2760 
gcagcactac catttattct ttcamSa attln^^ '^ cagtgcctag tatctggaga 2820 
gtggtcgcag gagattttgg aacgg^gg? lUlToZt tactaaLL 2880 

f ^ atgattgaat gcataatlfa tgctttq?q^ J^^^gagact tcagtttttt 2940 
agaaagtgca tcagtgaaga gatqcaaoan ^^r^^!^ ttctgactat caatacctaa 3000 
gcttgtctta taggatgrtt Igtltglltl tlltttf^'' ^tggcaaaaa gcaagctttj 306? 
tggtgtgaca gtgtttaaac gcaacSaaq q^t^o^^^f ' accaatggga cagtcataga 3120 
gagcctcact aagctatttt ^agatSt? LgSgat a"a1.T'='=^ ^cactgtcft 3180 

ywctgat aaattaaaaa aaaaaaaaaa 3240 
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aaattagact ccaccttaag tagtaaagta taacaggatt tctgtatact gtgcaatcag 3300 

ttctttgaaa aaaaagtcaa aagatagaga atacaagaaa agttttnggg atataatttg 3360 

aatgactgtg aaaacatatg acctttgata acgaactcat ttgctcactc cttgacagca 3420 

aagcccagta cgtacaattg tgttgggtgt gggtggtctc caaggccacg ctgctctctg 3480 

aattgatttt ttgagttttg gnttgnaaga tgatcacagn catgttacac tgatcttnaa 3540 

ggacatatnt tataaccctt taaaaaaaaa atcccctgcc tcattcttat ttcgagatga 3600 

atttcgatac agactagatg tctttctgaa gatcaattag acattntgaa aatgatttaa 3660 

agtgttttcc ttaatgttct ctgaaaacaa gtttcttttg tagttttaac caaaaaagtg 3720 

ccctttttgt cactggtttc tcctagcatt catgattttt ttttcacaca atgaattaaa 3780 

attgctaaaa tcatggactg gctttctggt tggatttcag gtaagatgtg tttaaggcca 3840 

gagcttttct cagtatttga tttttttccc caatatttga ttttttaaaa atatacacat 3900 

aggagctgca tttaaaacct gctggtttaa attctgtcan atttcacttc tagcctttta 3960 

gtatggcnaa tcanaattta cttttactta agcatttgta atttggagta tctggtacta 4020 

gctaagaaat aattcnataa ttgagttttg tactcnccaa anatgggtca ttcctcatgn 4080 

ataatgtncc cccaatgcag cttcattttc caganacctt gacgcaggat aaattttttc 4140 

atcatttagg tccccaaaaa aaaaaaaaaa aaaaaaaaaa a 4181 

<210> 176 
<211> 579 
<212> PRT 

<213> Homo sapiens 



<400> 176 



Met Asn 


Lvs 


Leu 


Tvr 


He 


Glv 


Asn 


Leu 


Ser 


Glu A<5n AT A Ala Ptd Re^r 


1 






5 










10 


15 


Asp Leu 


Glu 


Ser 


He 


Phe 


Lys 


Asp 


Ala 


Lys 


He Pro Val Ser Gly Pro 






20 










25 




30 


Phe Leu 


Val 


Lys 


Thr 


Gly 


Tyr 


Ala 


Phe 


Val 


Asp Cys Pro Asp Glu Ser 




35 










40 






45 


Trp Ala 


Leu 


Lys 


Ala 


He 


Glu 


Ala 


Leu 


Ser 


Gly Lys He Glu Leu His 


50 










55 








60 


Gly Lys 


Pro 


He 


Glu 


Val 


Glu 


His 


Ser 


Val 


Pro Lys Arg Gin Arg He 


65 








70 










75 80 


Arg Lys 


Leu 


Gin 


He 


Arg 


Asn 


He 


Pro 


Pro 


His Leu Gin Trp Glu Val 








85 










90 


95 


Leu Asp 


Ser 


Leu 


Leu 


Val 


Gin 


Tyr 


Gly 


Val 


Val Glu Ser Cys Glu Gin 






100 










105 




110 


Val Asn 


Thr 


Asp 


Ser 


Glu 


Thr 


Ala 


Val 


Val 


Asn Val Thr Tyr Ser Ser 




115 










120 






125 


Lys Asp 


Gin 


Ala 


Arg 


Gin 


Ala 


Leu 


Asp 


Lys 


Leu Asn Gly Phe Gin Leu 


130 










135 








140 


Glu Asn 


Phe 


Thr 


Leu 


Lys 


Val 


Ala 


Tyr 


He 


Pro Asp Glu Met Ala Ala 


145 








150 










155 160 


Gin Gin 


Asn 


Pro 


Leu 


Gin 


Gin 


Pro 


Arg 


Gly 


Arg Arg Gly Leu Gly Gin 








165 










170 


175 


Arg Gly 


Ser 


Ser 


Arg 


Gin 


Gly 


Ser 


Pro 


Gly 


Ser Val Ser Lys Gin Lys 






180 










185 




190 


Pro Cys 


Asp 


Leu 


Pro 


Leu 


Arg 


Leu 


Leu 


Val 


Pro Thr Gin Phe Val Gly 




195 










200 






205 


Ala lie 


lie 


Gly 


Lys 


Glu 


Gly 


Ala 


Thr 


He 


Arg Asn He Thr Lys Gin 


210 










215 








220 


Thr Gin 


Ser 


Lys 


He 


Asp 


Val 


His 


Arg 


Lys 


Glu Asn Ala Gly Ala Ala 


225 








230 










235 240 


Glu Lys 


Ser 


He 


Thr 


He 


Leu 


Ser 


Thr 


Pro 


Glu Gly Thr Ser Ala Ala 








245 










250 


255 


Cys Lys 


Ser 


He 


Leu 


Glu 


He 


Met 


His 


Lys 


Glu Ala Gin Asp He Lys 






260 










265 




270 


Phe Thr 


Glu 


Glu 


He 


Pro 


Leu 


Lys 


He 


Leu 


Ala His Asn Asn Phe Val 
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Gly Arg Leu lie Gly Lvs n„ ^i , , ^85 

290 ^ ^ 295 ^ 

^ Thr ASP rhr Lys lie He Ser P.o Leu oiu 
oyr A3n Pro Glu g Th. He ^ val Lys Gl'y Asn Val 
Ala Lys Ala Glu Glu Glu He Met Lys Lys' He Arg Glu 
^ ASP lie Ala Ser Met Asn Leu Gl'n Ala His Leu He 
Asn Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser G^ 
Pro Thr ser Gly Pro Pro Ser Ala Met Thr Pro III Tyr 
Glu Gin ser Glu Thr Glu Thr Val His Gin Phe He Pro 
Val Gly Ala He He Gly Lys Gin Gly JL' hIs- He Lys 
-g Phe Ala Gly Ala Ser He Lys til Ala Pro Ala Glu 

Ala Lys Val Arg Met Val He Al Thr n t> 

450 ® ™^ *^^y Pro Pro Glu 

jya Ala Gin Gly Arg He Tyr Gly Lys He Lys cll Glu 
ser Pro Lys Glu- Glu Val Lys Leu Glu Ala Ms He Arg 
Phe Ala Ala Gly Arg Val He Gly Lys ^ Gly Lys Thr 
I-u Gin Asn Leu Ser Ser Ala Glu Val Val Val Pro Arg 
Pro ASP Glu Asn Asp Gin Val 'vH Val Lys He Thr 

cys Gin Val Ala Gin Arg Lys He Gin Glu H^ Leu 
^ys Gin His Gin Gin Gin Lys Ala Leu Gin ler Gly Pro 
Arg Arg Lys 



He Glu Gin 

Leu Thr Leu 
320 

Glu Thr Cys 

335 
Ser Tyr Glu 
350 

Pro Gly Leu 

Met Pro Pro 

Pro Gin Phe 
400 

Ala Leu Ser 

415 
Gin Leu Ser 
430 

Ala Pro Asp 
Ala Gin Phe 



Asn Phe Val 
480 

Val Pro Ser 

495 
Val Asn Glu 
510 

Asp Gin Thr 

His Phe Tyr 

Thr Gin Val 
560 

Pro Gin Ser 
575 



<210> 177 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<400> 177 



atgccccgta aatgtcttca atrTi-+-r'^-+- 

agatccaaac aaatacacat tctqtqtttt tn^Tl^? ^atctcaaaa gatttggttc 60 
cacacagcaa aaaattgttt actttSJ^^ ^gctcagtgt tttctaaaaa aagaaactgc 120 
ggtgcttata aaaagt?a^a aaS^cgaS aactctf ^-^"-tca aaaaatga« HI 
gaagtgagct tgtgcttagt aS?acat?J Ittlnl '^^^^'^"cct gaccaagagg 240 
gcaaactggt gcagaaattc tataaacS? tf ^ff^^^ ttgtaatcac tgacttltgt 300 
attttgtttt gttttgtaaa aatgSa'aa,' c^?ca?aaaa ^^^^^-^^^ tttttgtt?c 360 

<210> 178 
<211> 561 
<212> DNA 
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<213> Homo sapiens 



<400> 178 

acgcctttca agggtgtacg caaagcactc attgataccc ttttggatgg ctatgaaaca 60 

gcccgctatg ggacaggggt ctttggccag aatgagtacc tacgctatca ggaggccctg 120 

agtgagctgg ccactgcggt taaagcacga attgggagct ctcagcgaca tcaccagtca 180 

gcagccaaag acctaactca gtcccctgag gtctccccaa caaccatcca ggtgacatac 240 

ctcccctcca gtcagaagag taaacgtgcc aagcacttcc ttgaattgaa gagctttaag 300 

gataactata acacattgga gagtactctg tgacggagct gaaggactct tgccgtagat 360 

taagccagtc agttgcaatg tgcaagacag gctgcttgcc gggccgccct cggaacatct 420 

ggcccagcag gcccagactg tatccatcca agttcccgtt gtatccagag ttcttagagc 480 

ttgtgtctaa agggtaattc cccaaccctt ccttatgagc atttttagaa cattggctaa 540 

gactattttc ccccagtagc g 561 

<210> 179 
<211> 521 
<212> DNA 

<213> Homo sapiens 
<400> 179 

cccaacgcgt ttgcaaatat tcccctggta gcctacttcc ttacccccga atattggtaa 60 
gatcgagcaa tggcttcagg acatgggttc tcttctcctg tgatcattca agtgctcact 120 
gcatgaagac tgg.cttgtct cagtgtttca acctcaccag ggctgtctct tggtccacac 180 
ctcgctccct gttagtgccg tatgacagcc cccatcaaat gaccttggcc aagtcacggt 240 
ttctctgtgg tcaaggttgg ttggctgatt ggtggaaagt agggtggacc aaaggaggcc 300 
acgtgagcag tcagcaccag ttctgcacca gcagcgcctc cgtcctagtg ggtgttcctg 360 
tttctcctgg ccctgggtgg gctagggcct gattcgggaa gatgcctttg cagggagggg 420 
aggataagtg ggatctacca attgattctg gcaaaacaat ttctaagatt tttttgcttt 480 
atgtgggaaa cagatctaaa tctcatttta tgctgtattt t 521 



<210> 180 

<211> 417 
<212> DNA 

<213> Homo sapiens 
<400> 180 



gcttgatggt cgaggccatc 60 
gctgggccgg aaggtggtgg 120 
cagaaacaag ttgaagtacc 180 
aggcccctac cacttccggg 240 
gccccacaaa accaagcgag 300 
cccaccgccc tacgacaaga 360 
tctgaagcct acaagaa 417 



ggtggaattc gccgaagatg 

tcctgggccg cctggcggcc 

tcgtacgctg tgaaggcatc 

tggctttcct ccgcaagcgg 

cccccagccg catcttctgg 

gccaggccgc tctggaccgt 

aaaagcggat ggtggttcct 

<210> 181 
<211> 283 . 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 35 

<223> n « A,T,C or G 



gcggaggtgc aggtcctggt 
atcgtggcta aacaggtact 
aacatttctg gcaatttcta 
atgaacacca acccttcccg 
cggaccgtgc gaggtatgct 
ctcaaggtgt ttgacggcat 
gctgccctca aggtcgtgcg 



<400> 181 

gatttcttct aaataggatg taaaacttct 

caagaactca agtgtaactg tgataaaata 

tgtaatctca gaatacacag gtgacataga 



ttcanattac tcttcctcag tcctgcctgc 60 
acctttccca ggtatattgg caggtatgtg 120 
tatgatatga caactggtaa tggtggattc 180 
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SaS^' ^t't'XS? alS^^S S^'lll" .-^^tcagttt ttt.aa.a.c «0 

<210> 182 ^^'^ 
<211> 401 
<212> DMA 

<213> Homo sapiens 
<400> 182 

il^qi: iTtlZlZ t\\™ aa^a.cctt eo 

agaggattga gtaagtagtt ggatggrttt cataa2!r» gcagatttca 120 

atgctttaag aaacatttgt tltaS^Scc tSSaaaJta "^^=^^9 aagaggattc 180 
tagcaggcag tgtgttttcc ttccatgtct c^^foc^Jf ^^^^^gggat aaaaactatg 240 
gctgcaagtc tgtcctatct gaattcSJo T.nlt t '='=t5cagtgt gtcctctgag 300 
ctagcagata aaactatggg gaall^ttl SSgt'g^l f^""^''*- accctatL? 36^ 



<210> 183 
<211> 366 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 
<222> 325 

<223> n = A,T,C or G 
<400> 183 



accgtgtcca agtttttaga acccttati-^ 

accatcatgc tttgatgtL ccctlt^St^ SttS f cctgg tcaccgtttc 60 
tttaaggaca aagatgaagt cactotaaa^ tl^ l f^ ctctcaagag caaaggttaa 120 
tttttcagtg cagaaattS aagJS^at aaa^S^"": "^""tac cttcc?tttc 180 
gtgtcggaat cactggtaaa tg?Jgggga gaaSccc T^LT^" ^tttttgcgt 240 
cactttgagc gctttaagag attan?ct|a gaaSaaS l^tTJ^T ^^^^g^^^a 300 
aaaaaa gaaataatta aatatctttt ctcttcaaaa 360 



<210> 184 
<211> 370 
<212> DNA 
<213> Homo sapiens 

<400> 184 



ttlTiill li~ caggaaaaat SO 

taaaatgtta gtctacatag atggSgatt aSac^^f ^ gcagatcaaa tatttggagt 120 
ttgcattcat gcttctgtg? acJ^ataa^g lllTtllll '^^T^*^^^ agatttcaaa 180 
tcagtctgct ctgtttaatt ctgctgtSg ctctSnf ^^^^^9^^ gatctctcct 240 
cacagtttag tgatatctag gagtaSS? t^rt^^^ atgc±gcgtc cctaattgta 300 
ggtttaaaaa ' gi^ataaag ttgtcgccca tcaataaaaa tcacaaagtt 360 

<210> 185 ■ 



<211> 107 
<212> DNA 
<213> Homo sapiens 

<400> 185 



SS^^'S Itl^l^ ~?e -S"' 



60 
107 
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<210> 186 

<211> 309 

<212> DNA 

<213> Homo sapiens 

<400> 186 

gaaaggatgg ctctggttgc cacagagctg ggacttcatg ttcttctaga gagggccaca 60 

agagggccac aggggtggcc gggagttgtc agctgatgcc tgctgagagg caggaattgt 120 

gccagtgagt gacagtcatg agggagtgtc tcttcttggg gaggaaagaa ggtagagcct 180 

ttctgtctga atgaaaggcc aaggctacag tacagggccc cgccccagcc agggtgttaa 240 

tgcccacgta gtggaggcct ctggcagatc ctgcattcca aggtcactgg actgtacgtt 300 

tttatggtt 309 

<210> 187 

<211> 477 

<212> DNA 

<213> Homo sapiens 

<400> 187 

ttcagtccta gcaagaagcg agaattctga gatcctccag aaagtcgagc agcacccacc 60 
tccaacctcg ggccagtgtc ttcaggcttt actggggacc tgcgagctgg cctaatgtgg 120 
tggcctgcaa gccaggccat ccctgggcgc cacagacgag ctccgagcca ggtcaggctt 180 
cggaggccac aagctcagcc tcaggcccag gcactgattg tggcagaggg gccactaccc 240 
aaggtctagc taggcccaag acctagttac ccagacagtg agaagcccct ggaaggcaga 300 
aaagttggga gcatggcaga cagggaaggg aaacattttc agggaaaaga catgtatcac 360 
atgtcttcag aagcaagtca ggtttcatgt aaccgagtgt cctcttgcgt gtccaaaagt 420 
agcccagggc tgtagcacag gcttcacagt gattttgtgt tcagccgtga gtcacac 477 

<210> 188 
<211> 220 
<212> DNA 
<213> Homo sapiens 

<400> 188 

taaatatggt agatattaat attcctctta gatgaccagt gattccaatt gtcccaagtt 60 
ttaaataagt accctgtgag tatgagataa attagtgaca atcagaacaa gtttcagtat 120 
cagatgttca agaggaagtt gctattgcat tgattttaat atttgtacat aaacactgat 180 
ttttttgagc attattttgt atttgttgta ctttaatacc 220 

<210> 189 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 76, 77 

<223> n = A,T,C or G 

<400> 189 

accatcttga cagaggatac atgctcccaa aacgtttgtt accacactta aaaatcactg 60 
ccatcattaa gcatcnnttt caaaattata gccattcatg atttactttt tccagatgac 120 
tatcattatt ctagtccttt gaatttgtaa ggggaaaaaa aacaaaaaca aaaacttacg 180 
atgcactttt ctccagcaca tcagatttca aattgaaaat taaagacatg ctatggtaat 240 
gcacttgcta gtactacaca ctttgtacaa caaaaaacag aggcaagaaa caacggaaag 300 
agaaaagcct tcctttgttg gcccttaaac tgagtcaaga tctgaaatgt agagatgatc 360 
tctgacgata cctgtatgtt cttattgtgt aaataaaatt gctggtatga aatgaca 417 
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<210> 190 
<211> 497 

<212> DNA 

<213> Homo sapiens 

<400> 190 



xtii itzzii ^ - 

acggtccgca aggatgccta ca?gttctgg SSat? ^^tatg ggattatgtg 120 
aacttctcag aactgcccct ggtcatataa ItfT.^ atgccaccaa ctcctgcaag 180 
ggatttggaa actttgagga fattgggcc? 2tgS g^^-^^cgg ttctagcac? 240 
acctggctcc aggctgccag tctcctattt nlr,^tT^^ atctcaaacc acggaaaacc 300 
tatgtgaatg gtagtggtgc cStgcSaa S.f ccgtgggcac tgggttcagt 360 
gttctcctga agacc??c?t 2S?g^? !aagaa?2c '^^^^^f^^ agacatgat'g 420 
ttctcagagt cctatgg aaagaattcc agacagttcc attctacatt 480 



<210> 191 
<2H> 175 
<212> DNA 
<213> Homo sapiens 



<400> 191 

SfS'S nssSS l^S^Ttl ITtST' 1'°^^" » 

<210> 192 
<211> 526 
<212> DNA 
<213> Homo sapiens 

<400> 192 

agtaaacatt attatttttt ttai-;:ii-i-4-^^ ^ 

aagaacagta ttgctgtaat' "2t"c?t ToTcTcl t'tlcL^^''^' ttcctataga 60 
attgaagaaa gagaaacttg tcaactcata t^on^^^^ f tttcctctgc cccttaaaag 120 
ctatcactaa gtaatgtato TtaTglTg tSSt^ta S^^"^^ ^cataagaat 180 
tcacaaaatt aaagcaagaa gtccataata a?^;f?^^ ccagtgacac cccatattca 240 
tcagagtttc tgaggtcaaa ?SSt t^!^f^**^^ taatagtgga tttttaatgc 300 
ttacttaatg tatt?tggtg ta^^cc" lalSll^t g<=tctatgat cttaaataat 360 
aattcctctg atcactttga gaaacaSct ttt^Jf ^ tggtgttcaa gactatatct 420 
ttttaaatat aaaaataalt It^Tctt ^St^ctja llltir^^ '"^'^^'^^ 

526 



<210> 193 
<211> 553 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 290, 300, 411, 441 
<223> n = A,T,C or G 

<400> 193 



cagtggtagc agttggactg acStJS c^gSaoatt ^^'''^^'^^CC atggccagL 120 
aagccatgaa gcatatggag cctcaajtal aaS^g'SS SiL'cS ^^^^""^^ ^^O 

yutut rcaaagccta ccaaaatctg 240 
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ccttcagtgg 

cattaatact 
gaattatgct 
atgaagctaa 
ttaagttcgt 
ctacaatttt 



tggctattat 

aggtgtaagc 
tttaaatcat 
agatttacta 
attagtttat 
aaa 



agaggtgggt 
cctactgcca 
cctgacaaag 
naaggtcaag 
gtatatgagt 



ttgaacccaa 
ataaagggaa 
gaggatctcc 
ctaaaaaatg 
actaagtttt 



aatgacaaan 

aataagagat 
ttatatagca 
aagtaaatgt 
tataataaaa 



cgggaagcan 300 
gctcatcgac 360 
nccaaaatca 420 
atgatgaatt 480 
tgcctcagag 540 
553 



•<210> 194 
<211> 320 
<212> DMA 
<213> Homo sapiens 



<400> 194 

cccttcccaa 

atgtcacttg 

attaagactc 

cccatctccg 

ccagaaaaaa 

attgacccat 



tccatcagta 
atatgagaat 
tgataattgt 
tttcatatca 
gtgagcctct 
atttatacct 



aagaccccat 
ctcaaatctc 
ctcccctcca 
gaactaccgt 
tcatctgcac 



ctgccttgtc 
aatgccttat 
taggaatttc 
ccccgatatt 
ctgtaatagt 



catgccgttt 
aagcattcct 
tcccaggaaa 
cccttcagag 
ttcagttcct 



cccaacaggg 60 
tcctgtgtcc 120 
gaaatatatc 180 
agattaaaga 240 
attttcttcc 300 
320 



<210> 195 
<211> 320 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 203, 218 
<223> n = A,T,C or G 



<400> 195 

aagcatgacc 

gtgaccagaa 

aactgtggtg 

ctgagtaaac 

ttgggtaaaa 

tatttggtaa 



tggggaaatg 
tctgccatgg 
ttagcaccag 
ttcttatttt 
gtcttttcca 
attatgaact 



gtcagacctt 
caacaggctt 
ccagctctct 
tanaaagggg 
caaaccacca 



gtattgtgtt 
taaaaaagac 
gtacatttgc 
aggctggntt 
tctattttgt 



tttggccttg 
ccttaaaaag 
tagcttgtag 
gtaactttcc 
gaactttgtt 



aaagtagcaa 60 
acactgtctc 120 
ttttctaaga 180 
ttgtacttaa 240 
agtcatcttt 300 
320 



<210> 196 
<211> 357 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 36 

<223> n = A, T,C or G 



<400> 196 

atataaaata 

tcactttaac 

aatactacaa 

tgatgatatg 

tataaatact 

aaaaaaaaaa 



atacgaaact 
tgtaaacaat 
aaacttattt 
acatctggct 
gtatggacaa 
ttttaagagc 



ttaaaaagca 
ttcttaggac 
atactgttct 
aaaaagaaat 
aaaatggcat 
tggtactaat 



ttggantgtc 
accatttggg 
tatgtcattt 
tattgcaaaa 
tttttatatt 
aaaggattat 



agtatgttga 
ctagtttctg 
gttatattca 
ctaaccacta 
aaattgttta 
tatgactgtt 



atcagtagtt 60 
tgtaagtgta 120 
tagatttata 180 
tgtacttttt 240 
gctctggcaa 300 
aaaaaaa 357 



<210> 197 
<211> 565 
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<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 27 

<223> n = A,T,C or G 
<400> 197 



lllllTtr, '""-^^ag tagaaaacta SO 

tggtcctaca ctttttagga tgc??StaI lllttl^^ ggttattaga tcattcacct 120 
gttcctatat tttgggj^t Sggg?^: l^^Jl^^ tftlT^'' aacatccctg 180 
agaaagtaag cccagggctt cagatctaaa ttlat^^!!! ^^^t ^^""^^ cagcagccct 240 
agttgtaatg ctagglltaa gcIcJctJta a^^Stta'a tttllT'^ tttaaagtca 300 
gaatgtttct gaaacattaa acttgtattt atrtcac?aa lltlT''''^ agoaattagg 360 
aaatgtgtct catacatatg ctatactaaa r^fl^t t ^^^^^^taaca caaacttaaa 420 
atttgaatat atgaaagaa? SjtJcaSa at^^^ff!^ catttctaaa tttgtgtatg 480 
atataatttg tacctaitgt aaJaa ^^St^^ttta aaattattaa aaataaatgt 540 



<210> 198 
<2H> 484 
<212> DMA 
<213> Homo sapiens 

<400> 198 



ctgttggatg tgtccLtgt ^SgS aSatt^'" ^^^^^^^^^c cgacagctga 120 
tgggcgcagc agcaggtggc aggSgtg? ^SaaJataa ^f^^^^"! ggacctccat 180 
tctctggtgc tttctgagag ggtctcSS gtggcagcgt ctggtcctcc 240 

agcacgtatt tc±cccc?ct agtaccSjg cat?tS2^ f tggcctgg gggaaggcag 300 
agggcagcag actcttgagt atacto™ a^^o!f Hf? ^^^^ccctct ggctttctga 360 
tccaggggct caactgacca aSaScfg aS^Sa^ ItT^""^^^ tcctgagg|c 420 
aaac y^aacacag aagttggggt atgtggccta tttgggtcgg 480 

484 

<210> 199 
<211> 429 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

r- f ; n''' 227, 274, 319 

<223> n = A,T,C or G 

<400> 199 

gcttatgttt tttgttttaa cttttcrttti- i-i-=.=^=4.4.^ 

tacagtacct ttctcanaca tttSSnaa Sc^S^ ^aatattaca ttttgtatta 60 
gaacattaaa aagngtgata gcga^a^JJg ngSScaa Sa^f"'" Sr^attttgct 120 
ataaacaana cacaacgttt ttatacaaca tso^ffof f^^gaaaaaa ggtagtctta 180 
attgtttcct attaagtatt attcttta^^ 1 Ittt^ tattaanaaa actcottaat 240 
caatttagca tttgc?ttng TttHHlll otltllll^ t^atgctttt gattttctct 300 
tatgtactgt atgggaaatg itgt^aa^S tac^JS^ "^^^S^taag gcacaaaaac 360 
tgaatccaa Q'^aaatat taccttttcc acattttaaa cagacaactt 420 

429 

<210> 200 
<211> 279 
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<212> DNA 

<213> Homo sapiens 

<400> 200 

gcttttttga ggaattacag ggaagctcct ggaattgtac atggatatct ttatccctag 60 
ggggaaatca aggagctggg cacccctaat tctttatgga agtgtttaaa actattttaa 120 
ttttattaca agtattacta gagtagtggt tctactctaa gatttcaaaa gtgcatttaa 180 
aatcatacat gttcccgcct gcaaatatat tgttattttg gtggagaaaa aaatagtata 240 
ttctacataa aaaattaaag atattaacta agaaaaaaa 279 

<210> 201 
<211> 569 
<212> DNA 
<213> Homo sapiens 

<400> 201 

taggtcagta tttttagaaa ctcttaatag ctcatactct tgataccaaa agcagccctg 60 
attgttaaag cacacacctg cacaagaagc agtgatggtt gcatttacat ttcctgggtg 120 
cacaaaaaaa aattctcaaa aagcaaggac ttacgctttt tgcaaagcct ttgagaagtt 180 
actggatcat aggaagctta taacaagaat' ggaagattct, taaataactc actttctttg 240 
gtatccagta acagtagatg ttcaaaatat gtagctgatt aataccagca ttgtgaacgc 300 
tgtacaacct tgtggttatt actaagcaag ttactactag cttctgaaaa gtagcttcat 360 
aattaatgtt atttatacac tgccttccat gacttttact ttgccctaag ctaatctcca 420 
aaatctgaaa tgctactcca atatcagaaa aaaaggggga ggtggaatta tatttcctgt 480 
gattttaaga gtacagagaa tcatgcacat ctctgattag ttcatatatg tctagtgtgt 540 
aataaaagtc aaagatgaac tctcaaaaa 569 

<210> 202 

<211> 501 

<212> DNA 

<2ll3> Homo sapiens 

<400> 202 

attaataggc ttaataattg 
tagcatctgg cagtggggcc 
gagcaacatg attgagaacc 
tgtacctgtg tggtctaagc 
aattcttgac aatgaaatga 
atagcaccac ctatcagcac 
gtgactgaca ttatgaaggc 
tttcttggca ggctcgttgt 
tggcatattt tggaattctg 



<210> 


203 


<211> 


261 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


misc feature 


<222> 


36, 96 


<223> 


n = A,T,C or 



<400> 203 

gacaagctcc tggtcttgag 
gataaaatga atgagttctg 
gttagctctt tgaatgttct 
tatcattgta taaaagctgt 



ttggcaagga tccttttgct 
aagaaaataa ggtttatgca 
agtgtatgtc aacaggtgca 

tggaatctgg tcaccttcca 
agctcaatgt gcatatggat 
tgaaaactct tttgcattaa 
ctgtactgaa gacagcaagc 
acctcttgga aaacctcaat 
c 



ttctttggca tgcaagctcc 60 
tgtatgatgg ttttcttctt 120 
tttgagataa ctttaaatga 180 
tccatgcaac aacttgttca 240 
tcaatcccac accatcgatc 300 
gggatcattg caagagcagc 360 
tgttagtaca gaccagatgc 420 
gcaagatagt gtttcagtgc 480 

501 



atgtcttctc gttaangaga tgggcctttt ggaggtaaag 60 
tcatgattca ctattntata acttgcatga cctttactgt 120 
tgaaatttta gactttcttt gtaaacaaat gatatgtcct 180 
tatgtgcaac agtgtggaga ttccttgtct gatttaataa 240 
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aatacttaaa cactgaaaaa a 

261 

<210> 204 
<211> 421 
<212> DNA 

<213> Homo sapiens 
<400> 204 

agcatctttt ctacaacgtt aaaattgcag aagtaactta tcati-;,^=»=. 
caacaataac aataaatcct aaatqtaaat caatt^^^^f tcattaaaaa acaacaacaa 60 
gcctgttttt tccctttttt cKc?gg^^^ JajSat^a^ ^^^''^f ^""^ aaggatatca 120 
gcctctttcc tcttctcatg StgaSc ccSJ^J^.! cttcttccca aatttctaca 180 
gcttgtgtgc ttggactcgg' ^llll^,, a^^L^^??? ^oZliT^^^ c1a\T^^'^ 1'' 
aactcaaacc ttcaagccct aggtgtaocc atttta+o^» ^^''^"^tta ctgttggaga 300 
act^c«= ,c=a„„., SJ^^J: 3eo 



<210> 205 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<400> 205 



"SgSa: a1— ~S ^-J^- ctctttagat 60 

-™ L-|2a -"I^JacI g-2?S 2 HI 
gaggaagatg ggJaSgaaJg gJcao;??t^ ^^afr^*"" ^f^^^^^gga ctctgggtca 240 
cagactttag aaaacLcJg gac^Saaat ^111X11 tacctttaaa 300 

gaatgagacc aaaggaaaag rtSaca^ac " atgacttgga cacatagact 360 

agagaatctt atg??tttt? Stgga'SL ^iTtTttll tttaaaagag 420 



<210> 206 
<211> 481 
<212> DNA 

<213> Homo sapiens 

<400> 206 



tgcg'Sa^':: gtgSa T^^^T ^^^^^^^^ ctgcgtgggc cgtctcctcc 60 
g?cccgcggg SjgJS? rtS^S otS^'"" tgagtgcctt ttgaacgctg 120 
cgcctgccct gggtStac SSa^cccI gaSStJ SacSc?^ f cgaggtcc 180 
cggccttccc atctgcctqc ccacccaa^^ ^!ir°^ tgtgctgctg tgtocggagg 240 
acctcccgcc otcagtcc?g caatatS ^^'^"^^^^c cggcgcaggg tcccaagccc 300 
cctggcctcc gcgcLgccc acccacanS ^^^ggcacgt cctgoacaca caatgcaagt 360 
ggtSgaccc cc?ggaggtg ccctcSccc frT^''^^ ccgccaactc tgttatttS 420 
^ y gg ggtg ccctcggccc accggggcta tttattgttt aatttatttg 480 



<210> 207 
<211> 605 
<212> DNA 
<213> Homo sapiens 

<400> 207 



tiziiiTci ?ccctSK iziTtii?^ s:stir ^° 

ctcactggat tctcacggta ggScS gatc^aatc ^^^^^^^ attgctttat 120 
ttttttgatc ctagggtgct ccttttott^ ttltll ^aagctccaa agttgtctac 180 
yaa get ccttttgttt tacagagcag ggtcacttga tttgctagct 240 
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ggtggcagaa ttggcaccat tacccaggtc 
tgtatcatga aatgatttga aatcattgta 
tttccttgtg ctttgataac aaagactcca 
aagggctaga ttgggatttg aagacaaaat 
aacattaatg aaagcaaaac attataaaag 
tcttgatgct tccaaatgac atctaccaga 
cataa 

<210> 208 
<211> 655 
<212> DNA 

<213> Homo sapiens 



tgactgacca ccagtcagag gcactttatt 300 
aagcagcgaa gtctgataat gaatgccagc 360 
aatattctgg agaacctgga taaaagtttg 420 
tgtaggaaat cttacatttt tgcaataaca 480 
taattttaat tcaccacata cttatcaatt 540 
tatggttttg tggacatctt tttctgttta 600 

605 



<400> 208 

ggcgttgttc tggattcccg tcgtaac±ta 
tgatgtcctg caaatgaagg aggaggatgt 
aggtggcacc aatcttgact tccagatgga 
catctatatc ataaatctca agaggacctg 
tgttgccatt gaaaaccctg ctgatgtcag 
ggctgtgctg aagtttgctg ctgccactgg 
tggaaccttc actaaccaga tccaggcagc 
tgaccccagg gctgaccacc agcctctcac 
tgcgctgtgt aacacagatt ctcctctgcg 
caagggagct cactcagtgg gtttgatgtg 
gcgtggcacc atttcccgtg aacacccatg 

<210> 209 
<211> 621 
<212> DNA 

<213> Homo sapiens 



aagggaaact ttcacaatgt ccggagccct 60 
ccttaagttc cttgcagcag gaacccactt 120 
acagtacatc tataaaagga aaagtgatgg 180 
ggagaagctt ctgctggcag ctcgtgcaat 240 
tgttatatcc tccaggaata ctggccagag 300 
agccactcca attgctggcc gcttcactcc 360 
cttccgggag ccacggcttc ttgtggttac 420 
ggaggcatct tatgttaacc tacctaccat 480 
ctatgtggac attgccatcc catgcaacaa 540 
gtggatgctg gctcgggaag ttctgcgcat 600 
ggaggtcatg cctgatctgt acttc 655 



<400> 209 

catttagaac atggttatca tccaagacta 
caaatccaca ttcctcttga gttctgcagc 
gccgtagaat cacatgatct gaggaccatt 
gagtcttcca taaagttttg catggagcaa 
tcagccctct aaaagcatag ggcttagcct 
tagttttgta aacactatag catctgttaa 
gccgtgactc tggactatat cagtttttgg 
ccacgtggac cagtctgaat gtctttcctt 
aagaaacaat ctaaacaagt ttctgttgca 
gtaggcttct atattgcatt taacttgttt 
ctattgatga ataaagaaat t 

<210> 210 
<211> 533 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 20, 21, 61 
<223> n = A,T,C or G 



ctctaccctg caacattgaa ctcccaagag 60 
ttctgtgtaa atagggcagc tgtcgtctat 120 
catggaagct gctaaatagc ctagtctggg 180 
acaaacagga ttaaactagg tttggttcct 240 
gcaggcttcc ttgggctttc tctgtgtgtg 300 
gatccagtgt ccatggaaac cttcccacat 360 
aaagcagggt tcctctgcct gctaacaagc 420 
tacacctatg tttttaaata gtcaaacttc 480 
tatgtgtttg tgaacttgta tttgtattta 540 
ttgtaactcc tgattcttcc ttttcggata 600 

621 



<400> 210 

cgqcttgggg agccggcggn ngagtccggg 
nggcccgcgg gcccagggtg gggatgcacc 
agaagaaact tgcagaggcc aagtataagg 



acgtggagac ccggggtccc ggcagccggg 60 
gccgcggggt gggagctggc gccatcgcca 120 
agcgagggac ggtcttggct gaggaccagc 180 
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aSSalS ll'll^lt ^^^^aggaa tttgccagca 240 

caaccattgg ^tggatc?? "ggStct^ gSaa'gSJ ^acatgtgtg 300 

tgggggactt ctattacgaa ctaggtrtcl aaattff^a^ Ij^l'^'^^^ atgctgggcg 360 
atcggaatgg aggtctgata act^SSaa Itrl.ll^t. agtgtgcctg gcgctgaagc 420 
gcaagttcgc ccaggat^c .^'^ZH ™- l^l^^lf^l^ a^T^^^''^ ^1 



<210> 211 
<211> 451 
<212> DNA 

<213> Homo sapiens 
<400> 211 



liTaZ^ltlT,^^^^^^^ g™S tlg^fS ^^^^^^^^^^ ^^^^^^^^a SO 
ggagcttcag caaggaagtg gSagcgo! altlllttt ^^^gacacct gctaccaagc 120 
tgcgcaaggc agctagc?t? JcggS^ Sa?c?ataa f ^^^^^^g cctgaggggc 180 
aagctgccct acccccagtg agccc?ctqa loacaaS^? gcgggatgcc gtgaagcgag 240 
agaaatccaa ggctatcat? gagSa?c ?cSt^«! f ^^aggag gagttagaga 300 
agtgcgtgca ggagctggcc tcfScJcct JgctcSca? 0?^"^!""" ^a^gcagtcc 360 
agtctacgct ggagcgcagt gccattgcto ctttgtacgg catggtgtcg 420 

451 

<210> 212 
<211> 471 
<212> DNA 

<213> Hcano sapiens 
<220> 

<221> raise feature 
<222> 54 

<223> n = A,T,C or 6 
<400> 212 

£2SaS« llZZtli lllT^Ta ll^'Tl'' "^5-"- ttanaatgga 60 
gcactggggt ggggg^ggf^ TgZlf.l Ic^Xttt cggagtagga 120 

gagatccagt gcagttgtga tttctatanf l^^ ^^t^ ggattccttg ttgttgtgtt 180 
ttggcttaaa tccLJ?^S lllcltllt^ tccqagcttg gttccaggaa ttttgtg^ga 240 
aacctgtctg acccggtcac ^Sltltt ^J^^^^""""^ gaacgtgaac tcagtagctg 300 
gggtgggaac tcacg?gggg Sc^^ggS gSa^SS ^'"^.^^^^ gtcggg^tgg 360 
tccatgggac tttccttc?? t?tS|S lllTt^H T'^^^'^^ J^J 

<210> 213 
<211> 511 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 27, 63, 337, 442 
<223> n - A,T,C or G 



<400> 213 



ctnccStI c2a?2?aa atLtt'""' ^^""^^gg gtcaaggacc ctctttatag 60 
actttatat? ?t1ccttt?; SaJaoS f f "^^^ tactaaaata ttttttatag 120 
atctcagccg tttccSgc? ttccTtTa Tj^llT ^gagttggtg taattaaai 180 
ctcttttaat cttaaagStc tacaMtM? glSa^^ ^^S^^? ^^a^ HI 
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taactcttcc cactgcatat ttccatcttg aattggnggt tctaaattct gaaactgtag 360 

ttgagataca gctatttaat atttctggga gatgtgcatc cctcttcttt gtggttgccc 420 

aaggttgttt tgcgtaactg anactccttg atatgcttca gagaatttag gcaaacactg 480 

gccatggccg tgggagtact gggagtaaaa t 511 

<210> 214 
<211> 521 
<212> DNA 

<213> Homo sapiens 

<400> 214 

agcattgcca aataatccct aattttccac taaaaatata atgaaatgat gttaagcttt 60 

ttgaaaagtt taggttaaac ctactgttgt tagattaatg tatttgttgc ttccctttat 120 

ctggaatgtg gcattagctt ttttatttta accctcttta attcttattc aattccatga 180 

cttaaggttg gagagctaaa cactgggatt tttggataac agactgacag ttttgcataa 240 

ttataatcgg cattgtacat agaaaggata tggctacctt ttgttaaatc tgcactttct 300 

aaatatcaaa aaagggaaat gaagtataaa tcaatttttg tataatctgt ttgaaacatg 360 

agttttattt gcttaatatt agggctttgc cccttttctg taagtctctt gggatcctgt 420 

gtagaagctg ttctcattaa acaccaaaca gttaagtcca ttctctggta ctagctacaa 480 

attcggtttc atattctact taacaattta aataaactga a 521 

<210> 215 

<211> 381 
<212> DNA 

<213> Homo sapiens 
<220> , 

<221> misc_feature 

<222> 17, 20, 60, 61, 365 

<223> n » A,T,C or G 



<400> 215 

gagcggagag cggaccngtn agagccctga 
ncatcacacc ccgggaggag ccgcagctgc 
ccatgagcag cgaggccgag acccagcagc 
tcagcgccgc cgacaccaag cccggcacta 
gcggcctcac atcggcggcg cctgccggcg 
tgggaacagt aaaatggttc aatgtaagga 
ccaangaaga tgtatttgta c 



gcagccccac cgccgccgcc ggcctagttn 60 
cgcagccggc cccagtcacc atcaccgcaa 120 
cgcccgccgc cccccccgcc gcccccgccc 180 
cgggcagcgg cgcagggagc ggtggcccgg 240 
gggacaagaa ggtcatcgca acgaaggttt 300 
acggatatgg tttcatcaac aggaatgaca 360 

381 



<210> 216 
<211> 425 
<212> DNA 

<213> Homo sapiens 



<400> 216 

ttactaacta ggtcattcaa ggaagtcaag 
gatggtgttg aaatgtccac cttcttaaat 
aacaggccaa tcctgaaggt actccctgtt 
gcataagagt cctatttgcc ccagttaatt 
ctggctctgt tagaactctg tccaaaaagt 
aattgacaat atatatgcat gtgtttaaac 
cataatagta tttattaaag aatcacaact 
tttag 



ttaacttaaa catgtcacct aaatgcactt 60 
ttttaagatg aacttagttc taaagaagat 120 
tgctgcagaa tgtcagatat tttggatgtt 180 
caacttttgt ctgcctgttt tgtggactgg 240 
gcatggaata taacttgtaa agcttcccac 300 
caaatccaga aagcttaaac aatagagctg 360 
gtaaacatga gaataactta aggattctag 420 

425 



<210> 217 
<211> 181 
<212> DNA 
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<213> Homo sapiens 
<400> 217 



S-S iT^^t l^iiTt " " 

.cttt^ctt «tc.^,,. .„„«^, -^tcat^t ot,..at«. 1.0 



<210> 218 
<211> 405 
<212> DNA 
<213> Homo sapiens 

<400> 218 



SSL^: ?Saiss sss - 

=1 i--^ 

acaaggcagg cctttccta'c Jgggg^t^S g^acSgcc ScS^S^ TT^*^' '''' 
ggcctgagtt ggcgttgtgg gca^gctact gS^tSa^o aStaJff ^ tggtaggaat 300 
attaatcttt tgtagtttgt attaLcttg iT^lUlH JaSa ^^^9— 360 

<210> 219 
<211> 216 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 207, 210 
<223> n = A,T,C or G 

<400> 219 

tT.ltttlll llltltZtt g^^lSS limitT ^^^^^""^ ^--^^-^ «o 

tcaattgtaa arttcttgtt aaaSottf ^off^'^^f cagattttct gttcaaatat 120 
aaaaataaaa aggaaaga'Jc ITc^lHi Saaaa'^" ^'^""^^^^ gggatattgc 180 



<210> 220 
<211> 380 
<212> DNA 
<213> Homo sapiens 

<400> 220 



tliitTotil g\%Tt?gcS Toiii^^^i ''''r^"^ ^^^^^^"^^ ^° 

atttcatctt tgagggLac tgaSga^g gStJtgttt TtTtlllll ^^^^^^^ ^^O 
gcaccccaag gactcagaao atoattttaa ^^ZT? gtgttctgat ggagaaaaca 180 

gcatgtaata Itgttgag?? gcaSSaaa otSJ^^f^ cagatgtgtg caatattggt 240 
tgcattgaaa aggaaaaS? rtcS^aaa a^a^^^^f ttatcttagt tcttcattac 300 
gtaagtcttt gllaaaaaa^ ^"^^^^^^^ atgcctgaca gtttaattta aaactatggt 360 

380 

<210> 221 
<211> 398 
<212> DNA 
<213> Homo sapiens 

<400> 221 
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ggttagtaag ctgtcgactt tgtaaaaaag 
tgtatattta atgaatgaac atgtacaatt 
gtgagtctgc aagtgaattt cactgatgtt 
cccagccccg tttcctttta ttttggagct 
agtaaaatag aatcagcaaa tcactcttat 
gtttctgtgg gagcagtgta caccaactct 
ttgtatgttg aataaaattt tctataaaaa 

<210> 222 
<211> 301 
'<212> DNA 
<213> Homo sapiens 



ttaaaaatga aaaaaaaagg aaaaatgaat 60 
tgccactggg aggaggttcc tttttgttgg 120 
gatattcatt gtgtgtagtt ttatttcggt 180 
aatgccagct gcgtgtctag ttttgagtgc 240 
ttttcatcct tttccggtat tttttgggtt 300 
tcctgtatat tgcctttttg ctggaaaatg 360 
ttaaaaaa 398 



<220> 

<221> misc_feature 

<222> 49, 64 

<223> n = A,T,C or G 



<400> 222 

ttcgataatt gatctcatgg gctttccctg 
taanaacttg aaacttgtaa actgagatgt 
gtgaagattt caaaacctga gagcactttt 
gatgacttta ggatttgcat ttttcccttt 
ggagggaaat ctgtttattt tttcctacaa 
a 



gaggaaaggt tttttttgnt gtttattttt 60 
ctgtagcttt tttgcccatc tgtagtgtat 120 
tctttgttta gaattatgag aaaggcacta 180 
attgcctcat ttcttgtgac gccttgttgg 240 
ataaaaagct aagattctat atcgcaaaaa 300 

301 



<210> 223 
<211> 200 
<212> DNA 
<213> Homo sapiens 

<400> 223 

gtaagtgctt aggaagaaac tttgcaaaca 
attccttcac actgtaattt aatgtgtttt 
agatttctac aggagacagt ggttttattt 
gctggatgaa cttaaaaaaa 

<210> 224 
<211> 385 
<212> DNA 
<213> Homo sapiens 

<400> 224 

gaaaggtttg atccggactc aaagaaagca 
gctgtaactg caagacctgg acaagagatt 
tctccaacac cagcaagccc taaccagggc 
ccaccaaagg acagttctgc ccctggtgga 
tcatcaaatg tgttaccaag acatcttgga' 
aaacagagca cttaatgtta tttacagttt 
ggccattttc aggtggtaaa aaaaa 

<210> 225 
<211> 560 
<212> PRT 
<213> Homo sapiens 



tttaatgagg atacactgtt catttttaaa 60 
atattctttt gtagtaaaac aacataactc 120 
ggattgtctt ctgtaatagg tttcaataaa 180 

200 



aaggagtgtg agccgccatc tgctggagca 60 
cgtcagcgaa ctgcagctca aagaaacctt 120 
cptcctccac aagttccagt atctcctgga 180 
cccccagaaa ggactgttac tccagcccta 240 
tcccctgcta cttcagtgcc tggaatgggt 300 
atattgtttt ctctggttac caataaaacg 360 

385 



<400> 225 

Met Glu Cys Leu Tyr Tyr Phe Leu Gly Phe Leu Leu Leu Ala Ala Arg 
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Leu Pro Leu Asp Ala Ala Lys Arg Phe lis Asp val Leu Gly Glu 
Arg Pro Ser Ala Tyr Met Arg Glu hL Asn Gin Leu Asn Sy Trp Ser 
Ser ASP Glu Asn Asp Trp Asn gL Lys Leu Tyr Pro vll Trp Lys Arg 
Gly ASP Met Arg Trp Lys Asn Ser Trp Lys Gly gL Arg Val Gin Ala 
Val Leu Thr Ser Asp Ser Pro Ala Leu Val Gly Ser Asn He Thr IL 
Ala val Asn Leu He Phe Pro Arg Cys gL Lys Glu Asp Ala L'n Gly 
Asn lie val Tyr Glu Lys Asn Cys Arg Asn Glu Ala Gly Jeu Ser Ala 
ASP Pro Tyr Val Tyr Asn Trp Thr Ala Trp Ser Glu 1^ Ser Asp Gly 
Glu Asn Gly Thr Gly Gin Ser His His Asn Val pje Pro Asp Gly Lys 
Pro Phe Pro His His Pro Gly Trp Arg Arg Trp Asn Phe He Tyr Val 
Phe His Thr Leu Gly Gin Tyr Phe Gin lys Leu Gly Arg Cys Ser Val 
Arg val Ser Val Asn Thr Ala Asn Val Leu Gly Pro 111 Leu Met 

Glu val Thr Val Tyr Arg Arg His Gly Arg Ala Tyr Val Pro He Ala 

220 

Gin val Lys Asp Val Tyr Val Val Thr Asp Gin He Pro Val Phe Val 
Thr Met Phe Gin Lys Asn Asp Arg Asn Ser Ser Asp Glu Thr Phe Itu 

250 one 

Lys Asp Leu Pro He Met Phe Asp Val Leu He His Asp Pro Ser His 
Phe Leu Asn Tyr Ser Thr He Asn ^yr Lys Trp Ser Ph. GlJ Asp Asn 

Thr Gly Leu Phe Val Ser Thr Asn His Thr Val Asn h!s Thr Tyr Val 

295 ^00 

Leu Asn Gly Thr Phe Ser Leu Asn Leu Thr Val Lys Ala Ala Ala Pro 

315 

Gly Pro cys Pro Pro Pro Pro Pro Pro Pro Arg Pro Ser Lys Pr. Thr 
Pro ser Leu Gly Pro Ala Gly Asp Asn III Leu Glu Leu Ser Irg He 
pro ASP Glu Asn Cys Gin He Asn l^g Tyr Gly His Phe Gin Ala Thr 
lie Thr He Val Glu Gly He Leu Glu Val Asn He He Gin Met Thr 
ASP val Leu Met Pro Val Pro Trp Pro Glu Ser Ser' Leu He Asp Phe 
val val Thr Cys Gin Gly Ser He Pro Thr Gl'u Val Cys O^r He H^e 
ser ASP Pro Thr Cys Glu He Thr Gin Asn Thr Val Cys Ser til Val 
ASP val ASP Glu Met Cys Leu Leu Thr Val Arg Arg ^r Phe Asn Gly 
ser Gly Thr Tyr Cys Val Asn Leu Thr Leu Gly Asp Z Thr Ser Leu 
Ala Leu Thr Ser Thr Leu He Ser Val Pro Asp Ir'g Asp. Pro Ala Ser 
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465 470 
Pro Leu Arg Met Ala Asn Ser Ala 
485 

He Phe Val Thr Val He Ser Leu 

500 

Tyr Asn Pro He Glu Asn Ser Pro 
515 520 
Leu Ser Val Phe Leu Asn Arg Ala 

530 535 
Gin Glu Lys Asp Pro Leu Leu Lys 
545 550 



475 480 
Leu He Ser Val Gly Cys Leu Ala 

490 495 
Leu Val Tyr Lys Lys His Lys Glu 
505 510 
Gly Asn Val Val Arg Ser Lys Gly 
525 

Lys Ala Val Phe Phe Pro Gly Asn 

540 

Asn Gin Glu Phe Lys Gly Val Ser 
555 560 



<210> 226 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 226 

He Leu He Pro Ala Thr Trp Lys Ala 
1 5 



<210> 227 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 227 

Phe Leu Leu Asn Asp Asn Leu Thr Ala 
1 5 



<210> 228 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400>, 228 

Leu Leu Gly Asn Cys Leu Pro Thr Val 
1 5 



<210> 229 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 229 

Lys Leu Leu Gly Asn Cys Leu Pro Thr Val 
15 10 



<210> 230 

<211> 10 

<212> PRT 

<213> Homo sapiens 
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<400> 230 

Arg Leu Thr Gly Gly Leu Lys Phe Phe Val 
^ 10 

<210> 231 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 231 

Ser Leu Gin Ala Leu Lys Val Thr Val 
J- 5 

<210> 232 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 232 

Ala Gly Ma Asp Val He Lys Asn Asp Gly Xle Ty. Ser Arg Tyr Phe 
Phe Ser Phe Ala 15 
20 

<210> 233 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 233 

Phe Phe ser Phe Ala Ala Asn Gly A.g Tyr Ser Leu Lys Val His Val 

Asn His Ser Pro Ser '^^ 15 

20 

<210> 234 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 234 

Phe Leu Val Thr Trp Gin Ala Ser Gly Pro Pro Gl« He He Leu Phe 
Asp Pro Asp Gly "^^ 15 

20 

<210> 235 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 235 
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10 



15 



Pro Asn Ser Asp 
20 



<210> 236 
<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 236 

lie Gin Asp Asp Phe Asn Asn Ala lie Leu Val Asn Thr Ser liys Arg 

15 10 15 

Asn Pro Gin Gin 
20 



<210> 237 
<211> 21 
<212> PRT 

<213> Homo sapiens 

<400> 237 

Arg Asn Ser Leu Gin Ser Ala Val Ser Asn lie Ala Gin Ala Pro Leu 

15 10 15 

Phe He Pro Pro Asn 
20 



<210> 238 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 238 

Thr His Glu Ser His Arg He Tyr Val Ala He Arg Ala Met Asp Arg 

15 10 15 

Asn Ser Leu Gin 
20 



<210> 239 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 239 

Arg Asn Pro Gin Gin Ala Gly He Arg Glu He Phe Thr Phe Ser Pro 

15 10 15 

Gin He Ser Thr 
20 



<210> 240 
<211> 21 
<212> PRT 

<213> Homo sapiens 
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<400> 240 

Gly Gin Ala Thr Ser Tyr Glu He Arg Met Ser Lys Ser Leu Gin Asn 

He Gin Asp Asp Phe 
20 

<210> 241 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 241 

Glu Arg Lys Trp Gly Phe Ser Arg Val Ser Ser Gly Gly Ser Phe Ser 

Val Leu Gly Val 
20 



<210> 242 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 242 

Gly Ser His Ala Met Tyr Val Pro Gly Tyr Thr Ala Asn Gly Asn He 

Gin Met Asn Ala 
20 



<210> 243 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 243 

Val Asn His Ser Pro Ser He Ser Thr Pro Ala His Ser He Pro Gly 

Ser His Ala Met 
20 



<210> 244 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 244 

Ala val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser Leu 

O 1 A 

His Phe Pro His ^5 
20 



<210> 245 
<211> 20 
<212> PRT 
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<213> Homo sapiens 
<400> 245 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

15 10 15 

Gin /Via Leu Lys 
20 



<210> 246 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 246 

Asn Leu Thr Phe Arg Thr Ala Ser Leu Trp lie. Pro Gly Thr Ala Lys 

15 10 15 

Pro Gly His Trp 
20 



<210> 247 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 247 

Leu His Phe Pro His Pro Val Met 

1 5 
Phe Tyr Pro lie 
20 



He Tyr Ala Asn Val Lys Gin Gly 
10 15 



<210> 248 

<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 248 

Pro Glu Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala 

1 5 10 ' 15 

Gly Ala Asp Val 
20 



<210> 249 
<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 249 

Gly Phe Tyr Pro He Leu Asn Ala Thr Val Thr Ala Thr Val Glu Pro 

1 5 -10 . . 15 

Glu Thr Gly Asp 
20 



<210> 250 



wo 02/00174 



112 



PCT/USOl/21065 



<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 250 

Phe ASP P.O ASP Gly Tyr Tyr Thr Asn Asn Phe Xle Thr Asn 

Leu Thr Phe Arg 15 
20 

<210> 251 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 251 

I.e« Gin Ma Leu Lys Val Thr Val Thr Ser Arg Ala Ser Asn Ser Ala 

Val Pro Pro Ala 15 
20 

<210> 252 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 252 

Met Ala Ser Val Axg Val Ala Ala Tyr Phe Glu Asn Phe Leu Ala Ala 
Trp Arg Pro Val Lys Ala Ser Asp aiy Tyr Tyr Thr Leu Ma Val 
Pro Met Oly ASP Val Pro Met Asp ^ xle Ser Val Ala ISp Xle Oly 
i^a Ala Val Ser Ser Xle Phe Asn Ser Pro Glu clu Phe Leu Gly Lys 
Ma val Gly Leu Ser Ala Glu Ala Leu Thr Xle 11 oi„ Ty. Ala Asp 
val Leu ser Lys Ala Leu Gly Lys Olu Val IJ, .3p Ala Lys Xle 11 
Pro Glu Ala Phe Glu Lys Leu Gly Phe Pro Ala Ala Lys Glu II Ala 
Asn Met Cys Arg Phe Tyr Olu Met Lys Pro Asp Arg Asp HI Asn Leu 
Thr His om Leu Asn P.o Lys Val Lys Ser Phe Ser Sn' Phe Xle Ser 
Glu Asn Gin Gly Ala Phe Lys Gly Met "° 



150 



<210> 253 
<211> 462 
<212> DNA 

<213> Homo sapiens 
<400> 253 



atggccagtg tccgcgtggc ggcctacttt oaaaacttt,^ i-r.^ 

taccct gaaaactttc tcgcggcgtg gcggcccgtg 60 
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aaagcctctg atggagatta ctacaccttg 
ggtatctctg ttgctgatat tggagcagcc 
tttttaggca aggccgtggg gctcagtgca 
gttttgt^cca aggctttggg gaaagaagtc 
gagaagctgg gattccctgc agcaaaggaa 
aagccagacc gagatgtcaa tctcacccac 
cagtttatct cagagaacca gggagccttc 

<210> 254 

<211> 8031 

<212> DNA 

<213> Homo sapiens 

<400> 254 

tggcgaatgg gacgcgccct gtagcggcgc 
cagcgtgacc gctacacttg ccagcgccct 
ctttctcgcc acgttcgccg gctttccccg 
gttccgattt agtgctttac ggcacctcga 
acgtagtggg ccatcgccct gatagacggt 
ctttaatagt ggactcttgt tccaaactgg 
ttttgattta taagggattt tgccgatttc 
acaaaaattt aacgcgaatt ttaacaaaat 
tcggggaaat gtgcgcggaa cccctatttg 
tccgctcatg aattaattct tagaaaaact 
tcatatcagg attatcaata ccatattttt 
actcaccgag gcagttccat aggatggcaa 
gtccaacatc aatacaacct attaatttcc 
aatcaccatg agtgacgact gaatccggtg 
agacttgttc aacaggccag ccattacgct 
cgttattcat tcgtgattgc gcctgagcga 
aattacaaac aggaatcgaa tgcaaccggc 
tttcacctga atcaggatat tcttctaata 
tggtgagtaa ccatgcatca tcaggagtac 
taaattccgt cagccagttt agtctgacca 
ctttgccatg tttcagaaac aactctggcg 
tcgcacctga ttgcccgaca ttatcgcgag 
tgttggaatt taatcgcggc ctagagcaag 
cccttgtatt actgtttatg taagcagaca 
cgtgagtttt cgttccactg agcgtcagac 
gatccttttt ttctgcgcgt aatctgctgc 
gtggtttgtt tgccggatca agagctacca 
agagcgcaga taccaaatac tgtccttcta 
aactctgtag caccgcctac atacctcgct 
agtggcgata agtcgtgtct taccgggttg 
cagcggtcgg gctgaacggg gggttcgtgc 
accgaactga gatacctaca gcgtgagcta 
aaggcggaca ggtatccggt aagcggcagg 
ccagggggaa acgcctggta tctttatagt 
cgtcgatttt tgtgatgctc gtcagggggg 
gcctttttac ggttcctggc cttttgctgg 
tcccctgatt ctgtggataa ccgtattacc 
agccgaacga ccgagcgcag cgagtcagtg 
tattttctcc ttacgcatct gtgcggtatt 
caatctgctc tgatgccgca tagttaagcc 
ggtcatggct gcgccccgac acccgccaac 
gctcccggca tccgcttaca gacaagctgt 
gttttcaccg tcatcaccga aacgcgcgag 
gtgaagcgat tcacagatgt ctgcctgttc 



gctgtaccga tgggagatgt accaatggat 120 
gtctctagca tttttaattc tccagaggaa 180 
gaagcactaa caatacagca atatgctgat 240 
cgagatgcaa agattacccc ggaagctttc 300 
atagccaata tgtgtcgttt ctatgaaatg 360 
caactaaatc ccaaagtcaa aagcttcagc 420 
aagggcatgt ag 462 



attaagcgcg gcgggtgtgg tggttacgcg 60 
agcgcccgct cctttcgctt tcttcccttc 120 
tcaagctcta aatcgggggc .tccctttagg 180 
ccccaaaaaa cttgattagg gtgatggttc 240 
ttttcgccct ttgacgttgg agtccacgtt 300 
aacaacactc aaccctatct cggtctattc 360 
ggcctattgg ttaaaaaatg agctgattta 420 
attaacgttt acaatttcag gtggcacttt 480 
tttatttttc taaatacatt caaatatgta 540 
catcgagcat caaatgaaac tgcaatttat 600 
gaaaaagccg tttctgtaat gaaggagaaa 660 
gatcctggta tcggtctgcg attccgactc 720 
cctcgtcaaa aataaggtta tcaagtgaga 780 
agaatggcaa aagtttatgc atttctttcc 840 
cgtcatcaaa atcactcgca tcaaccaaac 900 
gacgaaatac gcgatcgctg ttaaaaggac 960 
gcaggaacac tgccagcgca tcaacaatat 1020 
cctggaatgc tgttttcccg gggatcgcag 1080 
ggataaaatg cttgatggtc ggaagaggca 1140 
tctcatctgt aacatcattg gcaacgctac 1200 
catcgggctt cccatacaat cgatagattg 1260 
cccatttata cccatataaa tcagcatcca 1320 
acgtttcccg ttgaatatgg ctcataacac 1380 
gttttattgt tcatgaccaa aatcccttaa 1440 
cccgtagaaa agatcaaagg atcttcttga 1500 
ttgcaaacaa aaaaaccacc gctaccagcg 1560 
actctttttc cgaaggtaac tggcttcagc 1620 
gtgtagccgt agttaggcca ccacttcaag 1680 
ctgctaatcc tgttaccagt ggctgctgcc 1740 
gactcaagac gatagttacc ggataaggcg 1800 
acacagccca gcttggagcg aacgacctac 1860 
tgagaaagcg ccacgcttcc cgaagggaga 1920 
gtcggaacag gagagcgcac gagggagctt 1980 
cctgtcgggt ttcgccacct ctgacttgag 2040 
cggagcctat ggaaaaacgc cagcaacgcg 2100 
ccttttgctc acatgttctt tcctgcgtta 2160 
gcctttgagt gagctgatac cgctcgccgc 2220 
agcgaggaag cggaagagcg cctgatgcgg 2280 
tcacaccgca tatatggtgc actctcagta 2340 
agtatacact ccgctatcgc tacgtgactg 2400 
acccgctgac gcgccctgac gggcttgtct 2460 
gaccgtctcc gggagctgca tgtgtcagag 2520 
gcagctgcgg taaagctcat cagcgtggtc 2580 
atccgcgtcc agctcgttga gtttctccag 2640 
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acgagagigg atgctcacga Sg^?t1c Jga?SSaf f?^"^'^^ taccgatgaa 2760 
ttgtgagggt aaacaaotgg cgg^SqJ? a?aaSaa«n '^^^^^'^^^^^^ tactggaacg 2820 
tcaatgccag cgcttcgt?a a?a2S agg?g???ca 0^^""^ tcactcaggg 2880 
tgcgatgcag atccoaaaoa aggtgttcca cagggtagcc agcagcatcc 2940 

cgaa'acLg? HtllH^,^ ccaSS? ???tSL^' ttccgcgttt ccagacttta 3000 
gcagtcgctt cacgt?cg?? cgcgtafcgg SJ?catt? tlXll^' ""^-^^ ^060 
ccgccagcct agccgggtcc tcaacgacag gjgcacga^c alTclll^t ^^^'^caacc 3120 
catgccggcg ataatggcct acttctcocr att^Z^tl ^^^cgcaccc- gtggggccgc 3180 
ggcttgagcg agggcSgca ZlfoZl t'JSJgSagJ SSgSS: 1^,'^'' I''' 
gctccagcga aagcggtcct caccoaaaat n^r^nrZ^ gacaggccga tcatcgtcgc 3300 
gagttgcatg ata'a^a^ga ca'gtStlal ?gc°ggcS?g SaSac' ^^^^^^^^ ^^eo 
ccggaaggag ctgactggqt toaaooctc? f atagtcatgc cccgcgccca 3420 

atgagtgagc taacttacK taa??acaS al J-t^ I'' ggtcgagatc ccggtgccta 3480 
cctgtcltic cagSgcS aaJgaaSgg lolT.lTal r''^^'""" ^^^-^^^^aaa 3540 
tgggcgccag gg?gg?tttt ctt?Sacca ^^^f ^^^^ gggagaggcg gtttgcgtat 3600 
ccgcctggc? ??gKagaS tgcagc^S: iTcltlall T^tSllT "^^^^^^ ^660 
aatcctgttt gatggtggtt aacoacaaaa ttlt^Tf Q^^tgcccc agcaggcgaa 3720 
atcccactac cgagftSc gc2?2?ac gctgtcttcg gtatcgtcgt 3780 

cgcccagcgc cat?tgatcg ?tggcaac?a qcatcaS^ ^t<=g9taatg gcgcgcattg 3840 
gcatttgcat ggtttottoa aaa^™^! S^atcgcagt gggaacgatg ccctcattca 3900 
tcggctgaat ??gSJc?a SjSac^ gtcgccttcc cgttccgcta 3960 

agacagaact taatgg|ccc 5?af2gca caa^ttao^ '^^^^'^S^ agacgcgccg 4020 
gctccacgcc cagtlgcgta llllc^Tl lalUltTt ^^^^^^ gcgaccagat 4080 
• ggtcagagac atcaagaS? llcgZlall cUTJT ^^^^'^^ftg atgggtgtot 4140 
catcctggtc atccagcgga la^lallTa lllTr-^^ t ggcagcttcc acagcaatgg 4200 
tgtgcaccgc cgcttLSg gStcglcg^ IglltZTc ScSff ^cgagaagat 4260 
tggcacccag ttqatcaaca ca^nJtt^^ cgcttcgttc taccatcgac accaccacgc 4320 
gggccagac? gglggtS 2a?ca2^! ^^^ccgcgac aatttgcgac ggcgcgtgL 4380 
ccacgcggtt qgqaatSS ^^o!^ f gcaacgactg tttgcccgcc agttgttgtg 4440 
tcgcagaaac SScSgcc SScacc! "^f"^""^" ttccactttt tcccgcg?t? 4500 
catactctgo gaS^cSat S^rttacta If^f^'^^^t^ ggtctgataa gagacaccgg 4560 
cttccgggig ctatc^Scc Sg^^ llTtTttall f'^f'^'^^ aattgactct 4620 
tctcgacgct ctcccttatq caactccCo tttlll^^ ccattcgatg gtgtccggga 4680 
ccgttgagca cogccgccq? 2aaallltt ^^^^^^aagc agcccagtag taggttgagg 4740 
ccggccacgg ggJc^gccac tlllllllt ^f^*^^ agatggcgcc caacagtccc 4800 
cgagcccga? StccJc^^c gJtgatScg gXilTatl cqSf'""^ -cgaagtgg 4860 
gcgccggtga tqccaaccaf> oai-«^-,r^ gcgatatagg cgccagcaac cgcacctgtg 4920 
aa?taa?acg actcac^SJ gqoSt^ ttlT^T^ tcgagatctc gatcccgcg? 4980 
ttttgtttaa ctt^^agSg gf^J^S JScaSa'? ^^^'^'f -tagaaataa 5040 
acagcttcaa gacaatggg? JtSggat? gSSSca .T.TT ^^^^^^^^^t 5100 
gaatcagaac ctcatctcaa aca?taSqa Eaat^^c^ attaatcctc aggtacctga 5160 
taatgctacc aagagaaqaq tltllttr-tl Itltl gaagcttcat tttaoctatt 5220 
gaaagctaat aatlTcllH aSSaa^a JqScata.' f tttaatac ctgccacatg 5280 
gactgactgg tatqggqcac attlt^^t f^t^l''^^^^ gaaaaggcaa atgtcatagt 5340 
aaaagagggi aaa?a?a?tc ^llllTall tTalltZtt T'^fT ^^^^^^^^^ ^400 
tggctacgga tcacgaggcc gagtSSrt ccS^^atoa ^^^^^^^ata acttaacagc 5460 
gttcgatgag tataaclltg acSacrtS ctStaS? a^^f ^ gttggggtgt 5520 
gacaaggtgt tcatctgaci tcacaSaS^ tttta^^^^ gggcaaaatc aaattaaagt 5580 
agaaaactgt attattaqta aar^^^f^ ttttgtgtgt gaaaaaggtc cttgccccca 5640 
ccaaaatgca actgcatSa tSaJJ^^^ agaaggatgc acctttatct acaatagcac 5700 
taatgcaSgt accL^Lcc ^SgS TalTctl^l lll^^'l'f "^^^^"^ "60 
aagtgcatgg gatgtaatca caqactctan tltn^lt^ aaccagatgt gcagcctcag 5820 
gactgagctt ccacctcc?S cSSSS^ Ij-t^^l cacagctttc ccatgaacgg 5880 
tttagtgctg gatgtqSa a^tnll^ gcttgtagag gctggtgaca aagtggtctg 5940 
agcc^cfgaa' t'tt^aStS ?^g2?S SS^Jt a'^^'^^ aactacaaca 6000 
tttcgacagc aaaggaqaaa tcaaao™ „!f accttcgtgg gcattgccag 6060 

ggagaga tcagagccca gctacaccaa attaacagca atgatgatcg 6120 
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aaagttgctg gtttcatatc tgcccaccac tgtatcagct aaaacagaca tcagcatttg 6180 
ttcagggctt aagaaaggat ttgaggtggt tgaaaaactg aatggaaaag cttatggctc 6240 
tgtgatgata ttagtgacca gcggagatga taagcttctt ggcaattgct tacccactgt 6300 
gctcagcagt ggttcaacaa ttcactccat tgccctgggt tcatctgcag ccccaaatct 6360 
ggaggaatta tcacgtctta caggaggttt aaagttcttt gttccagata tatcaaactc 6420 
caatagcatg attgatgctt tcagtagaat ttcctctgga actggagaca ttttccagca 6480 
acatattcag cttgaaagta caggtgaaaa tgtcaaacct caccatcaat tgaaaaacac 654 0 
agtgactgtg gataatactg tgggcaacga cactatgttt ctagttacgt ggcaggccag 6600 
tggtcctcct gagattatat tatttgatcc tgatggacga aaatactaca caaataattt 6660 
tatcaccaat ctaacttttc ggacagctag tctttggatt ccaggaacag c±aagcctgg 6720 
gcactggact tacaccctga acaataccca tcattctctg caagccctga aagtgacagt 6780 
gacctctcgc gcctccaact cagctgtgcc cccagccact gtggaagcct ttgtggaaag 6840 
agacagcctc cattttcctc atcctgtgat gatttatgcc aatgtgaaac agggatttta 6900 
tcccattctt aatgccactg tcactgccac agttgagcca gagactggag atcctgttac 6960 
gctgagactc cttgatgatg gagcaggtgc tgatgttata aaaaatgatg gaatttactc 7020 
gaggtatttt ttctcctttg ctgcaaatgg tagatatagc ttgaaagtgc atgtcaatca 7080 
ctctcccagc ataagcaccc cagcccactc tattccaggg agtcatgcta tgtatgtacc 7140 
aggttacaca gcaaacggta atattcagat gaatgctcca aggaaatcag taggcagaaa 7200 
tgaggaggag cgaaagtggg gctttagccg agtcagctca ggaggctcct tttcagtgct 7260 
gggagttcca gctggccccc accctgatgt gtttccacca tgcaaaatta ttgacctgga 7320 
agctgtaaaa gtagaagagg aattgaccct atcttggaca gcacctggag aagactttga 7380 
tcagggccag gctacaagct atgaaataag aatgagtaaa agtctacaga atatccaaga 7440 
tgactttaac aatgctattt tagtaaatac atcaaagcga aatcctcagc aagctggcat 7500 
cagggagata tttacgttct caccccaaat ttccacgaat ggacctgaac atcagccaaa 7560 
tggagaaaca catgaaagcc acagaattta tgttgcaata cgagcaatgg ataggaactc 7620 
cttacagtct gctgtatcta acattgccca ggcgcctctg tttattcccc ccaattctga 7680 
tcctgtacct gccagagatt atcttatatt gaaaggagtt ttaacagcaa tgggtttgat 7740 
aggaatcatt tgccttatta tagttgtgac acatcatact ttaagcagga aaaagagagc 7800 
agacaagaaa gagaatggaa caaaattatt ataatgaatt ctgcagatat ccatcacact 78*60 
ggcggccgct cgagcaccac caccaccacc actgagatcc ggctgctaac aaagcccgaa 7920 
aggaagctga gttggctgct gccaccgctg agcaataact agcataaccc cttggggcct 7980 
ctaaacgggt cttgaggggt tttttgctga aaggaggaac tatatccgga t 8031 



<210> 255 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 9, 67, 247, 275, 277, 397 

<223> n = A,T,C or G 

<400> 255 

gtggccagng actagaaggc gaggcgccgc' gggaccatgg cggcggcggc ggacgagcgg 60 

agtccanagg acggagaaga cgaggaagag gaggagcagt tggttctggt ggaattatca 120 

ggaattattg attcagactt cctctcaaaa tgtgaaaata aatgcaaggt tttgggcatt 180 

gacactgaga ggcccattct gcaagtggac agctgtgtct ttgctgggga gtatgaagac 240 

actctangga cctgtgttat atttgaagaa aatgntnaac atgctgatac agaaggcaat 300 

aataaaacag tgctaaaata taaatgccat acaatgaaga agctcagcat gacaagaact 360 

ctcctgacag agaagaagga aggagaagaa aacatangtg g 401 

<210> 256 
<211> 401 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 



<222> 1, 37, 51, 79, 96, 98, 103, 104, 107, 116 167 Iftl ifl^ 
194, 206, 276, 303, 307, 308, 310, 323, 332, 'JJi, ^'53, 37^ '''' 

<223> n « A,T,C or G 
<400> 256 

tggtggncct gggatgggga accgcggtgg cttccgngga ggtttcggca ntggcatcca 60 
gggccggggt cgcggccgng gacggggccg gggccnangc cgnnganctc gcS^nJcaa 120 
ggccgaggat aaggagtgga tgcccgtcac caacttgggc cgcttgncca KqaSa?aaa III 
nancaagccc ctgnaggaga tctatntctt cttccctgcc ccattaagga a??aSaat 240 
catttgattt cttcctgggg gcctctctca aggatnaggt ttttgaagat SSScaaS 30S 
canaaannan accccgttgc ccngtccatc tncacccaac ncttccaagg gcnat?S? 36? 
tttaggcctc attncngggg ggaaccttaa cccaatttgg - ^ gcnarttttg 360 



5 401 



<210> 257 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 382, 387 
<223> n = A,r,C or G 

<400> 257 



Ti-^lll aacacttcat aaaatgtaaa gggctataac aaatatgtta taaagtaatt 60 
ctctcagccc tgaggtatac agaatcattt gcctcagact gctgttggat t^aaafS^ i5n 
ttaaaatatc tgctaagtaa tttgctatgt cttctcccac LtltcaKa ^gcctgSJc 18? 

lo^W:f^c^ T^T^.t "^^^^^^^^ tgttatgagc tttggacatg Staaccgt 40 
gcctgttcag agtgtctaca gtaagagctg gacaaactct ggagggacac aotctttaL 3nn 
acagctcttt tggttgcttt ccacttttct gaaaggttca cagSacct? SSaataSa III 
gaaactccca gttaaagcct angctancaa ttttttttag t ctagataata 360 



<210> 258 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> "258 



ss? s-s^ fs??s =^ H£i 
si— ?^ -s- ~- i~ 

attcagagag attgcagaag catatgaaac actctcagat g ctgaagcaaa 360 



<210> 259 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 259 



attgggtttg gagggaggat gatgacagag gaatgccctt tggccatcac gattttaatt 60 
ctccagaata ttgtgggttt gatcatcaat gcagtcatgt tlggctgcat ?Sca^g:aa 120 
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acagctcagg ctcacagaag ggcagaaact ttgattttca gccgccatgc tgtgattgcc 180 
gtccgaaatg gcaagctgtg cttcatgttc cgagtgggtg acctgaggaa aagcatgatc 240 
attagtgcct ctgtgcgcat ccaggtggtc aagaaaacaa ctacacctga aggggaggtg 300 
gttcctattc accaactgga cattcctgtt gataacccaa tcgagagcaa taacattttt 360 
ctggtggccc ctttgatcat ctgccacgtg attgacaagc g 401 

<210> 260 
<211> 363 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc feature 

<222> 1, % 19, 41, 63, 73, 106, 111, 113, 116, 119, 156, 158, 
162, 187, 247, 288, 289, 290, 292, 298, 299, 300, 340 
<223> n = A,T,C or G 

<400> 260 

aggaganang gagggggana tgaataggga tggagaggga natagtggat gagcagggca 60 
canggagagg aancagaaag gagaggcaag acagggagac acacancaca nangangana 120 
caggtggggg ctggggtggg gcatggagag cctttnangt cncccaggcc accctgctct 180 
cgctggnctg ttgaaaccca ctccatggct tcctgccact gcagttgggc ccagggctgg 240 
cttattnctg gaatgcaagt ggctgtggct tggagcctcc cctctggnnn anggaaannn 300 
attgctccct tatctgcttg gaatatctga gtttttccan cccggaaata aaacacacac 360 
aca 363 

<210> 261 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 114, 152 
<223> n = A,T,C or G 



<400> 261 

cggctctccg ccgctctccc ggggtttcgg 
tcaccttccc ctgacctgag tagtcgccat 
cttccctgga tttgatgagc gggctgatgc 
gggcacagat gaggagagca tcctgactct 
ggaaatctct gcagctttta agactctgtt 
agaactaact ggaaaatttg aaaaattaat 
tgatgcttat gaactgaaac atgccttgaa 

<210> 2 62 

<211> 401 

<212> DNA' 

<213> Homo sapiens 



ggcacttggg tcccacagtc tggtcctgct 60 
ggcacaggtt ctcagaggca ctgngactga 120 
anaaactctt cggaaggcta tgaaaggctt 180 
gttgacatcc cgaagtaatg ctcagcgcca 240 
tggcagggat cttctggatg acctgaaatc 300 
tgtggctctg atgaaaccct ctcggcttta 360 
gggagctgga a 401 



<220> 

<221> misc_feature 

<222> 7, 26, 258, 305, 358, 373, 374, 378 
<223> n * A,T,C or G 



<400> 262 

agtctanaac atttctaata ttttgngctt tcatatatca aaggagatta tgtgaaacta 60 
tttttaaata ctgtaaagtg acatatagtt ataagatata tttctgtaca gtagagaaag 120 
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tcaactcaaa aattatgntg cataatttS ctctaaatca agcctagttg 240 

tccancttca atgagaaal? ZlltllTot Itl^ t aggttttggg actacttttt 300 
tttttttgct aa^naVLl ttT.?.T.t T.t^^H ^^^^^^^^ ^^^^^"^^ ^60 

<210> 263 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__feature 

<222> 232, 290, 304, 326, 383 

<223> n « A,T,C or G 

<400> 263 

T£?oZ g%~ LXS llllir'' f — - 

gcggcggtgg cggctagggc ggcggcaaat aa™™ ^agaggcagc tacggcggcg 120 
cactgcggca ggcccaggjg ??gaSgggc cccggSc?? aacc^ff atgcggtgac 180 
ctttcctcaa ctctccatct tctcctacca acraf^ff agcccgtccc gncggacccg 240 
ctancccctt ccccgtccct tcccS^ccc l^lrtTr.^ ccgaggcggn ctcaggctcc 300 
cctcccacca tggc?ctgaa HZTclTc a^^gaattga ^^'^^^^^^^ ccgccacccg 360 

<210> 264 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 264 

tlll^lll JtlSgaaa" ^^^IT^l^ ^StcT^^ T''''''''^ ^'=^^^^-^^ 
actttggcca gcattgacc? SaaaaS!^ ^tgttctgga atactcacgt gagggaactt 120 
cttcaSttt ?caJSctc ctgL?cS? .aTtTe^Tt t'St^^"" -cttggctg 180 
ctaagaaaac aactctgtca aa?gctg?S ?ct^caa«i^ ^^^'^^^ccac agtgatagcc 240 
accacaacaa agagggaagt aaaSa?a2 ^ acacaacaaa aagacctgtc 300 

gggtgacctg a?a?Sca?c I'^^tllTt S^gLIS T^'"^^'^ "^^^^^^^^ ^ 

<210> 265 
<211> 271 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> miis cofeature 
<222> 59 

<223> n - A,T,C or G 
<400> 265 

StMgggg tlTt^il ITdllllTt iTolTttl ct?^^^^^^ « 
gttagaagtt tctagatctg gLaaacaca alZllllT '^"^ggggtg tgggattcaa 120 
ggaggctgag gcaggcgqat catafS! f ^'^^^^^^ cctgtaatcc cagcacttta 180 
gaaaccc?g? ctc?ac?Sa aSaSaaaa f'^^'^^^^ accgtcctgg ctaacacagt 240 

271 

<210> 266 
<211> 401 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 45 

<223> n = A, T,C or G 



<400> 266 

attcataaat ttagctgaaa gatactgatt caatttgtat acagngaata taaatgagac 60 
gacagcaaaa ttttcatgaa atgtaaaata tttttatagt ttgttcatac tatatgaggt 120 
tctattttaa atgactttct ggattttaaa aaatttcttt aaatacaatc atttttgtaa 180 
tatttatttt atgcttatga tctagataat tgcagaatat cattttatct gactctgtct 240 
tcataagaga gctgtggccg aattttgaac atctgttata gggagtgatc aaattagaag 300 
gcaatgtgga aaaacaattc tgggaaagat ttctttatat gaagtccctg ccactagcca 360 
gccatcctaa ttgatgaaag ttatctgttc acaggcctgc a 401 

<210> 267 
<211> 401 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 116, 247, 277, 296, 307, 313, 322, 323, 336, 342, 355, 365, 

377, 378, 397 

<223> n = A, T,C or G 



<400> 267 

gaagaggcat cacctgatcc cggagacctt 
tgtggagtcg gatcctcttc ggggtgagcc 
catgcagctg ttcccgcgag gcctgtttga 
ccaggtgtac agccttgtgc ctgacaggac 
agagcanggg gagacaaaat cgtccagctg 
tctttcnctt ganggactta cnngggaccc 
tgggncccga aaccccnnta tttgcccttg 

<210> 268 
<211> 223 
<212> DNA 

<213> Homo sapiens 



tggagttaag aggcggcgga agcgagggcc 60 
agggtcggcg cgcgcggctg tctcanaact 120 
ggacgcgctg ccgcccatcg tgctgaggag 180 
cgtggccgac cggcagctga aggagcttca 240 
ggcttcnact tggatgccca tggaanttat 300 
aagaanccct tncaaggggc ccttngtgga 360 
ggggggncca a 401 



<400> 268 

tcgccatgtt ggccaggctg gtcttgaact cctgacttta agtgatccac ccgcctcaac 60 
ctcccaaagt gctgggatta caggtgtgag ccaccgcgcc tggcctgata catactttta 120 
gaatcaagta gtcacgcact ttttctgttc atttttctaa aaagtaaata tacaaatgtt 180 
ttgttttttg ttttttttgt ttgtttgttt ctgttttttt ttt 223 



<210> 269 
<211> 401 

<212> DNA 

<213> Homo sapiens 



<400> 269 

actatgtaaa ccacattgta ctttttttta ctttggcaac aaatatttat acatacaaga 60 
tgctagttca tttgaatatt tctcccaact tatccaagga tctccagctc taacaaaatg 120 
gtttattttt atttaaatgt caatagttgt tttttaaaat ccaaatcaga ggtgcaggcc 180 
accagttaaa tgccgtctat caggttttgt gccttaagag actacagagt caaagctcat 240 
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ttttaaagga gtaggacaaa gttgtcacag gtttttotta fi-«^4-4-4.4.=4. 4. 

attacatgtt aatttccatt tatatcaggg attctattt^ !^ tgcccccaaa 300 

cattttgtct cattgttttc tttgaca?aa Sagga^ca f gtgaagttgc 360 

<210> 270 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 240, 382 

<223> n - A,T,C or G 

<400> 270 

■ IXtl t^™? Itllll-- -gaggctg 60 

XlllZt g-Xa1 5~ ~1 - 
agaagtttcc agtgataagt gSSSgJ? cJa™^ ^^f ^^^=^5 octgtagagn 240 
ttcccaaaat gag?gcttct gtgcgttaca lll^tt^tt 
tgttttttct tttcaattct anatgaacat gggalSaa^ ^'^^^^^-^^ gtgatgactt 360 



5 401 



<210> 271 
<2H> 329 
<212> DNA 
<213> Homo sapiens 

<400> 271 



lT.llgT^ gcaSSt'c Sc^^cS rT'^'' ^^^^^^^^'^^ ggcccaagtt 60 
agcccSca; acScccct Sc~ I'tTJTt cccctgctgg ctggtgcctg 120 
gaggcaagcc atggagtgag acSaaac rnlt If ^^g^g^cttc tgcggggcct 180 
acccccagcc cccacccgg? ggtt"??cct S?;t:taic Tc^Tl'^J ^cttttccca 240 
cttatggcat ctcattgagg acaaaaaaa ^"^^^'^^^ tgtgtatagt gccaccacag 300 

329 

<210> 272 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

18^, 185, 188, IM 202 III', HI' 

s>\": 

<221> misc_feature 

<222> 341,-342, 343, 345, 346, 351, 358, 360, 362, 363, 387, 390, 
<223> n = A,T,C or G 
<400> 272 

"SS IS^SI^ ?,'S=cS? »=t"cot„ .,.coco«o ogcttocc, 60 

~- — S ---- - 
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aagnnaangc gggntacacc tncntgcant ggnccacnct gcngggaact ntacacacct 240 
acgggatgtg gctgcgccan gagccaagag cntttctgga tgattcccca gcctcttgnn 300 
agggantcta caacattgct nnntaccttt ntccnncngc nnntnntgga ntacaggngn 360 
tnntaacact acatcttttt tactgcnccn tncttggtgg g 401 

<210> 273 
<211> 401 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 399 

<223> n = A,T,C or G 



<400> 273 

cagcaccatg aagatcaaga tcatcgcacc 
tggctccatc ctggcctcac tgtccacctt 
cgacgagtcg ggcccctcca tcgtccaccg 
gtagcatttg ctgcatgggt taattgagaa 
ctcatgctag cctcacgaaa ctggaataag 
tatctgatat cagcactgga ttgtagaact 
aactgttccc cttggtatta acgtgtcagg 

<210> 274 
<211> 401 
<212> DNA 

<213> Homo sapiens 



cccagagcgc aagtactcgg tgtggatcgg 60 
ccagcagatg tggattagca agcaggagta 120 
caaatgcttc taaacggact cagcagatgc 180 
tagaaatttg cccctggcaa atgcacacac 240 
ccttcgaaaa gaaattgtcc ttgaagcttg 300 
tgttgctgat tttgaccttg tattgaagtt 360 
gctgagtgnt c 401 



<400> 274 

ccacccacac ccaccgcgcc ctcgttcgcc 
cgccgcccag gccatcgcca ccctccgcag 
cctaccgcag gatgttcggc ggcccgggca 
acgtgactac gtccacccgc acctacagcc 
gcagcctcta cgcctcgtcc ccgggcggcg 
tgcggagcag cgtgcccggg gtgcggctcc 
acgccatcaa caccgagttc aagaacaccc 

<210> 275 
<211> 401 
<212> DNA 

<213> Homo sapiens 



tcttctccgg gagccagtcc gcgccaccgc 60 
ccatgtccac caggtccgtg tcctcgtcct 120 
ccgcgagccg gccgagctcc agccggagct 180 
tgggcagcgc gctgcgcccc agcaccagcc 240 
tgtatgccac gcgctcctct gccgtgcgcc 300 
tgcaggactc ggtggacttc tcgctggccg 360 
gcaccaacga g 401 



<400> 275 

ccacttccac cactttgtgg agcagtgcct 
ctggcctggg cctgggcttc gggagagcag 
gaagggactt acctcccaaa ggttctgcag 
agctcctggg tgtgtcagag gccagcctgg 
agggagaggg agaggggacc cgaggctgag 
gacacggcag tgatgctgcg gtctctcctc 
ctcctgaacc actctttctt caagcagatc 

<210> 276 
<211> 401 
<212> DNA 

<213> Homo sapiens 



tcagcgcaac ccggatgcca ggtatccctg 60 
agggtgctca ggagggtaag gccagggtgt 120 
gggaatctgg agctacacac aggagggatc 180 
ggagctctgg ccactgcttc ccatgagctg 240 
gcataagtgg caggatttcg ggaagctggg 300 
ccctttccct ccaggcccag tgccagcacc 360 
aagcgacgtg c 401 



<220> 
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<221> mis cofeature 
<222> 11 

<223> n = A,T,C or 
<400> 276 



s r ~ ~' ""^^^ -"S's; 
5 sF'F s« 

gtgattgcaa tcagcatggg atttggccat ttctatggca c acttgctttg 360 



<210> 277 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 227, 333 
<223> n = A,T,C or G 

<400> 277 



aactttggca acatatctca gcaaaaacta cagctatgtt attcatocca aaataaa^^.^ fin 

S'cScS^^ ^''T'^'^t -'^^^^^^ ^-tV^^^^^^^^ llTtTtT. 11 

?^cScanatn ntZ . ^^^^tcaagt cccgctcatt acaaattctt cttgccagtg 180 
llTgclTclt SaSttSt L'^^'^""' catcatgtgt tacgagnggc gctcaagga? 240 
y^tgcttctt gaaaattgct tagttgaaaa atggagagat cagcttaata aaaaatcrai- -^inn 

llXllTcl IX'Jtli: T^^'^^^^^ ganaaca?tt ca^Sg^ ZlllZTo 3 S 
cgggcgcacc agtcgtagta atccccccaa accaaaggga a 401 



<210> 278 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 322, 354 
<223> n = A,T,C or G 

<400> 278 



F'It ~" '-cs* >'°o 
H ~" ^ z 
^ = EiLi iiSi i-- ~ I" 



<210> 279 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc feature 



wo 02/00174 



123 



PCTAJSOl/21065 



<222> 30, 35r. 81, 88, 180, 212, 378, 384, 391 
<223> n = A,T,C or G 

<400> 279 

aaattattgc ctctgataca tacctaagtn aacanaacat taatacctaa gtaaacataa 60 
cattacttgg agggttgcag nttctaantg aaactgtatt tgaaactttt aagtatactt 120 
taggaaacaa gcatgaacgg cagtctagaa taccagaaac atctacttgg gtagcttggn 180 
gccattatcc tgtggaatct gatatgtctg gnagcatgtc attgatggga catgaagaca 240 
tctttggaaa tgatgagatt atttcctgtg ttaaaaaaaa aaaaaatctt aaattcctac 300 
aatgtgaaac tgaaactaat aattttgatc ctgatgtatg ggacagcgta tctgtaccag 360 
gctctaaata acaaaagnta gggngacaag nacatgttcc t 401 

<210> 280 
<211> 326 
<212> DNA 
<213> Homo sapiens 



<400> 280 

gaagtggaat tgtataattc aattcgataa 
gttttttttg ttgttttttt tttaagaact 
tttttgccca tctgtagtgt atgtgaagat 
tagaattatg agaaaggcac tagatgactt 
atttcttgtg acgccttgtt ggggagggaa 
ctaagattct atatcgcaaa aaaaaa 

<210> 281 
<211> 374 
<212> DNA 
<213> Homo sapiens 



ttgatctcat gggctttccc tggaggaaag 60 
tgaaacttgt aaactgagat gtctgtagct 120 
ttcaaaacct gagagcactt tttctttgtt 180 
taggatttgc atttttccct ttattgcctc 240 
atctgtttat tttttcctac aaataaaaag 300 

326 



<400> 281 

caacgcgttt gcaaatattc 
tcgagcaatg gcttcaggac 
atgaagactg gcttgtctca 
cgctccctgt tagtgccgta 
ctctgtggtc aaggttggtt 
gtgagcagtc agcaccagtt 
tctcctggcc ctgg 

<210> 282 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 26, 27, 51, 137, 180, 222 
<223> n = A,T,C or G 

<400> 282 

agtgtggtgg aattcccgca tcctanncgc cgactcacac aaggcagagt ngccatggag 60 
aaaattccag tgtcagcatt cttgctcctt gtggccctct cctacactct ggccagagat 120 
accacagtca aacctgnagc caaaaaggac acaaaggact ctcgacccaa actgccccan 180 
accctctcca gaggttgggg tgaccaactc atctggactc anacatatga agaagctcta 240 
tataaatcca agacaagcaa caaacccttg atgattattc atcacttgga tgagtgccca 300 
cacagtcaag ctttaaagaa agtgtttgct gaaaataaag aaatccagaa attggcagag 360 
cagtttgtcc tcctcaatct ggtttatgaa acaactgaca aaca 404 



ccctggtagc ctacttcctt 
atgggttctc ttctcctgtg 
gtgtttcaac ctcaccaggg 
tgacagcccc catcaaatga 
ggctgattgg tggaaagtag 
ctgcaccagc agcgcctccg 



acccccgaat attggtaaga 60 
atcattcaag tgctcactgc 120 
ctgtctcttg gtccacacct 180 
ccttggccaa gtcacggttt 240 
ggtggaccaa aggaggccac 300 
tcctagtggg tgttcctgtt 360 

374 
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<210> 283 
<211> 184 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 26 

<223> n = A,T,C or G 
<400> 283- 

iS SiH iiE i= ^ - 

aaaa ^gttatggaa aatgtcaacc tttgtaagaa aaccaaaata 180 

184 

<210> 284 
<211> 421 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 147, 149 
<223> n = A,T,C or G 

<400> 284 

^T.l"Ht l^l^^t '^tllr»i l^^Ttl " 

=S EES iSi ilpH Sps i 
p^-s ~ ~ ~ ~ Si i 

421 

<210> 285 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 34, 188 

<223> n = A,T,C or G 

<400> 285 

ctgggtggta actctttatt tcattotcca aaana^a^o^^ ^ 

cactgtgcag gcttcagctt ccactcro^n gaanaaagat gggagtggga acagggtgga 60 
ctgcSggtg cacagccctg actccn^^^^ caggattcag gctatctggg accgcaggga 120 
ct?ttcanag ccttggagga gc^aatccat '^ff'^^^^ aggtgacggg actggaagcc 180 
gcaggggatg gataaacJgg KacaS aaT.rlt^^ tgagtgccac tctgcagttt 240 
ggtgaagccc cggcgctgag ctaaan^o-^ gcattcctca cagccaacag. tgtaggtctt 300 
yy gccc cggcgctgag ctaagctcag gctgttccag ggagccacga aactgcaggt 360 

361 

<210> 286 
<211> 336 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> inisc_feature 

<222> 40, 68, 75, 127, 262 

<223> n = A,T,'C or G 



<400> 286 

tttgagtggc agcgccttta tttgtggggg 

ggaggaanag ccganaaact gtgtgaccgg 

cttgcanatg cccattggca tcaccggtgc 

tcttgttcaa catagggtag gtggcagcca 

ggcgctccat tttgtgttcc angagcatgt 

tgaggatgtt ctcgatgcag ctgcgctggc 

<210> 287 
<211> 301 
<212> DNA 
<213> Homo sapiens 



ccttcaaggn agggtcgtgg ggggcagcgg 60 
ggcctcaggt ggtgggcatt gggggctcct 120 
agccattggt ggcagcgggt . accggtcctt 180 
cgggtccaac tcgcttgagg ctgggccctg 240 
ggttctgtgg cgggagcccc acgcaggccc 300 
ggaaaa 336 



<220> 

<221> misc_feature 

<222> 15, 33, 44, 53, 76, 83, 107, 117, 154, 166, 192, 194, 207, 

215, 241, 246 

<223> n = A,T,C or G 



<400> 287 

tgggtaccaa atttntttat 
ttggtacaac ttatanaaaa 
cagggaagtc accccacggc 
gtctcccagg gngngcttgt 
nttgcncaag ttggtcacgt 

g 

<210> 288 
<211> 358 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 39, 143, 226 
<223> n = A,T,C or G 



ttgaaggaat ggnacaaatc 
ggnaaaggaa accccaacat 
tatggggaaa ttancccgag 
caaaaanata ttccnccaag 
ggtcacccaa ttctttgatg 



aaanaactta agnggatgtt 60 
gcat genet g ccttggngae 120 
gcttancttt cattatcaet 180 
ccaaattcgg gcgctcccat 240 
gctttcacct gctcattcag 300 

301 



<400> 288 

aagtttttaa actttttatt tgcatattaa 

tttgaacaaa aaaaaaaatg gcactctgat 

gggccagctt ggttttactc tanatttcac 

ttcttcettc accaacatgc aagttctttc 

gggaaaggea ggcgcggcct tcgttgtcag 

tcatttaaac ttgatccaac ctctttgcat 

<210> 289 
<211> 462 
<212> DNA 
<213> Homo sapiens 



aaaaattgng cattccaata attaaaatca 60 
taaactgcat tacagcctgc aggacacctt 120 
tgtcgtccca ccccacttct tccaccccac 180 
cttccctgcc agccanatag atagacagat 240 
tagttctttg atgtgaaagg ggcagcacag 300 
cttacaaagt taaacagcta aaagaagt 358 



<220> 
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<221> inisc_feature 

<222> 87, 141, 182; 220, 269, 327 

<223> n = A,T,C or G 



<400> 289 



ggcatcagaa atgctgttta tttctctgct gctcccaagc tggctggcct ttacaaaaaa an 
gcagacaaca gatgcatagt tgggganaaa gggaggacag g??cclgg^ aglg^^gS 120 
ggctgaggga ggaagggtaa naggaaggaa ggccatcctg gatccccLa t?tSgt?S 180 
anat^ggac aaagggactc ccaagccccc aaatcatcan aaaacaccaa ggagSgSg 240 
gagcttgagc aggccccagg gagcctcana gccataccag ccactgtcta cttcccatcc 300 
tcctctccca ttccctgtct gcttcanacc acctcccagc taagcLcag ctSSccc 360 
ct™2f ^^^"^"^9 cttgacagtc acagtgcctg gaattccac? acSggct? 420 
ctcccagttg gattaggacg tcgccctgtt agcatgctgc cc Jgg 



<210> 290 
<211> 481 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 44, 57, 122, 158, 304, 325, 352, 405 
<223> n = A,T,C or G 

<400> 290 

ItlllltTtt ttV^^^'lt aagaaaaaag caataagcaa tggnggtaaa tctctanaac 60 
llltTnttt "^^955'=" cctcccccga gaatgtgaca ttttgatttc caaacatgcc 120 
anaagtgtat ggttcccaac tgtactaaag taggtganaa gctgaagtcc tcaagtgttc 180 
atcttccaac ttttcccagt ctgtggtctg tctttggatc agcaataatt gcctgaacag 240 
Tc^tltltr ttcgttgatt tttgtctgta gctctctgag ctcctctatg ?gcagcaat? 300 
tTotllllt ^9^^^°"°* ttaanaactg catctcctgt gtcaaaacca anaaLtgtt 360 
tgtctaaagc aacaggtaag ccctcttttg tttgatttgc cttancaact gcatcctgtg 420 
tcaggcgcto ctgaaccaaa atccgaattg ccttaagcat taccaggtaa ?catc2gac 480 
^ 481 

<210> 291 
<211> 381 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 79, 166, 187, 208, 219, 315 

<223> n = A,T,C or G 

<400> 291 

attaalaa^J ttgtttaatt ctaaatcaaa toactttcac aacagtgaaa 60 

l^llltr^ ggttaaggng tgocactgta catatcatca ttttctgact ggggtcagga 120 
aSf ^^^'^^^^f^g gtggcaggag gagggtggag gctaanaaca cagLaacac 180 
acaaaanaaa ggaaagctgc cttggcanaa ggatgaggng gtgagcttgc cgaaggatgg 240 
tgggaagggg gctccctgtt ggggccgagc caggagtccc aagtcagctc tcctgcctta 300 

:iinzfo r ^^^^^^ ---'-^ Sg2??s III 

^ 381 

<210> 292 
<211> 371 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> mis cofeature 

<222> 32, 55, 12, 151, 189, 292 . 

<223> n = A,T,C or G 



<400> 292 

gaaaaaataa tccgtttaat tgaaaaacct 
gaggctgagg anaccaaacc cctacatcac 
gcagcaggca aagacaattc ccaaaacctc 
taccaccanc acatttttcc tcagccagcc 
gatcgccttc tcgttgaaat taatcccaca 
ggggactcgg ttcttcgaca tggaagggat 
acagcactta a 



gnaggatact attccactcc cccanatgag 60 

ctcgtagcca cttctgatac tcttcacgag 120 

nacaaaagca attccaaggg ctgctgcagc 180 

cccaatcttc tccacacagc cctccttatg 240 

gcccacagta acattaatgc ancaggagtc 300 

tttctcccaa tctgtgtagt tagcagcccc 360 

371 



<210> 293 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 75, 196, 222 
<223> n = A,T,C or G 



<400> 293 

gatttaaaag aaaacacttt attgttcagc 
tccataattt attgngatgt tatcaacatc 
ctgttcatgt tttcttgaac acgtcttcaa 
tgaggtaacg aagcanaagt atttttaaac 
cctatatgct caatacatgc cgcgtgatcc 
tttggaaaac atctgttatg atgactttca 
c 



aattaaaagt tagccaaata tgtatttttc 60 
aagtaaaatg ctcattttca tcatttgctt 120 
ttttccttcc aaaatgctgc atgccacact 180 
atgacagcta anaacattca tctacagcaa 240 
tagtagtttt ttcacaacct tctacaagtt 300 
tacaccttca cctcaaaggc tttcttgcac 360 

361 



<210> 294 
<211> 391 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 26, 77, 96, 150, 203, 252, 254, 264, 276 
<223> n =» A,T,C or G 



<400> 294 

tattttaaag 

atattgactc 

tattttttat 

tacatatgcc 

agggcatgca 

atttgaagtt 

cgatgtaatt 



tttaattatg 
tgtatanacc 
tctgaaaatg 
caaaatggct 
ananaaaatc 
atttcagtca 
gaaattcccc 



attcanaaaa 
acagttattg 
atattaatan 
ganaataaat 
tcanaatacc 
tctttgtctt 
tttttatcaa 



aatcgagcga 
gggganaagg 
aaagtcccgt 
acaacaggaa 
caaagnggca 
tggctccatg 
t 



ataactttct 
gctggtaggt 
ttccagtctg 
atgcaaaagc 
acaaggaacg 
tttcaggatg 



ctgaaaaaat 60 
taaattatcc 120 
attataaaga 180 
tgtaaagcta 240 
tttggctgga 300 
cgtgtgaact 360 
391 



<210> 295 
<211> 343 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> ndsc_feature 
<222> 145, 174, 205, 232 
<223> n - A,T,C or G 



<400> 295 



EiS iiH Si£ ~ « ^= .'^0 
~ ~ ~" ~s - 

tntt^o^^f! ttattacact tttagtgagc taaaatcctt ttaacatagc ctgcaaataa 300 
tctttcacaa aagccaagcc tcatttacaa agggtttatt tct "^cggatga 300 



<210> 296 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 96, 98, 106, 185 
<223> n = A,T,C or G 

<400> 296 



SSH iSr" - 

241 

<210> 297 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 12, 130 

<223> n = A,T,C or G 

<400> 297 

o"rt22?l 'ZITS' -'"W., gccac=a.tt 60 

'^'F- ?~ -° 

calcccc?' "oScaS 2tJitT f'^'T'^ tcc.gtggct ggtaccccac 2(0 

ttcacgaaot ot™ a°a^XS HZtTclt III 

tggaaaagta caaaaagaca gccagaggtg t ^aaccagaca gaagcagctc 360 



<210> 298 
<211> 321 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 14, 30, 76, 116, 201, 288, 301 
<223> n « A,T,C or G 
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<400> 298 

caagccaaac tgtntccagc tttattaaan 
ggcaggacat gggcanacaa tcgttaacag 
atggactacc aaaaatcaaa aagccactat 
tgaacaggga aagtttaaag ngagggttga 
ttttcacaag ccgaccctga ctttcaggaa 
natccacaat ctaaaaatgg a 

<210> 299 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_feature 
<222> 104, 268, 347 
<223> n = A,T,C or G 



atactttcca taaacaatca tggtatttca 60 

tatacaacaa ctttcaaact cccttnttca 120 
aaaacccaat gaagtcttca tctgatgctc 180 
catttcacat ttagcatgtt gtttaacaac 24 0 
gtgaaatgaa aatggcanaa tttatctgaa 300 

321 



<400> 299 

tatcataaag agtgttgaag tttatttatt 
attggtaaaa aaataaaaca aaaagcattt 
agaagtatca tttttctttg tcaaattata 
tggaattttg tcggtcactt gcactggttg 
agttaaattt tttttgttgg gatttcanat 
caacgtccac accaaattct tgatcaggac 
taggtagtct caca'gccttg cgtgttcgat 

<210> 300 
<211> 188 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 48 

<223> n = A,T,C or G 



atagcaccat tgagacattt tgaaattgga 60 
gaattgtatt tggnggaaca gcaaaaaaag 120 
ctgtttccaa acattttgga aataaataac 180 
acaagattag aacaagagga acacatatgg 240 
agagtttggt ttataaaaag caaacagggc 300 
caccaatgtc atagggngca atatctacaa 360 
attcaaagac t 401 



<400> 300 

tgaatgcttt gtcatattaa gaaagttaaa gtgcaataat gtttgaariac aataagtggt 60 

ggtgtatctt gtttctaata agataaactt ttttgtcttt gctttatctt attagggagt 120 

tgtatgtcag tgtataaaac atactgtgtg gtataacagg cttaataaat tctttaaaag 180 
gaaaaaaa 188 

<210> 301 
<211> 291 
<212> DNA 
<213> Homo sapiens 



<400> 301 

aagattttgt tttattttat tatggctaga 
acactaaaga aatcctctgt gcttttcaat 
tggtgtgact tcaagagttc atgttaactt 
tgtattcttg aagagcctgg gccatgaaga 
tgatgttctg gcagtaagtg tttatctggc 



aagacactgt tatagccaaa atcggcaatg 60 
atgcaaatat atttcttcca agagttgccc 120 
cttttctgga aacttccttt tcttagttgt 180 
gcttgcctaa gttttgggca gtgaactcct 240 
ctgcaatgag cagcgagtcc a 291 



<210> 302 
<211> 341 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 25 

<223> n = A,T,C or G 
<400> 302 

tgatttttca taattttatt aaatnatcac tgggaaaact aatggttcgc gtatcacaca 60 
attacactac aatctgatag gagtggtaaa accagccaat ggaatccagg taaagtacaa 120 
aaacgccacc ttttattgtc ctgtcttatt tctcgggaag gagggttcta ctttacacat 180 
ttcatgagcc agcagtggac ttgagttaca atgtgtaggt tccttgtggt tatagctgca 240 
gaagaagcca tcaaattctt gaggacttga catctctcgg aaagaagcaa actagtggat 300 
cccccgggct gcaggaattc gatatcaagc ttatcgatac c 341 

<210> 303 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 15, 27, 92, 124, 127, 183, 198, 244, 320 
<223> n « A,T,C or G 

<400> 303 

tgcagacagt aaatnaattt tatttgngtt cacagaacat actaggcgat ctcgacagtc 60 

gctccgtgac agcccaccaa cccccaaccc tntacctcgc agccacccta aaggcgactt 120 

caanaanatg gaaggatctc acggatctca ttcctaatgg tccgccgaag tctcacacag 180 

tanacagacg gagttganat gctggaggat gcagtcacct cctaaactta cgacccacca 240 

ccanacttca tcccagccgg gacgtcctcc cccacccgag tcctccccat ttcttctcct 300 

actttgccgc agttccaggn gtcctgcttc caccagtccc acaaagctca ataaatacca 360 
^ 361 

<210> 304 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 23, 104, 192 
<223> n = A,T,C or G 

<400> 304 

ctctttacaa cagcctttat ttncggccct tgatcctgct cggatgctgg tggaggccct 60 
tagctccgcc cgccaggctc tgtgccgcct ccccgcaggc gcanattcat gaacacggtg 120 
ctcaggggct tgaggccgta ctcccccagc gggagctggt cctccagggg cttcccctcg 180 
aaggtcagcc anaacaggtc gtcctgcaca ccctccagcc cgctcacttg ctgcttcagg 240 
tgggccacgg tctgcgtcag ccgcacctcg taggtgctgc tgcggccctt gttattcctc 300 
^ 301 

<210> 305 
<211> 331 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> 3, 36, 60, 193, 223 

<223> n = A,T,C or G 



<400> 305 

ganaggctag taacatcagt tttattgggt 
ggggctggcc ctcacaggtt gttgagttcc 
tcgacgttct cctccttggc actggccaag 
aactttgcca canacctctc ggcaaactct 
tccaacagtt tgatctcctc ttcatattta 
gccatcaggg acttgagggc ctggtccatg 



tggggnggca accatagcct ggctgggggn 60 
agcagggtct ggtccaaggt ctggtgaatc 120 
gtctcttcta ggtcatcgat ggttttctcc 180 
gctcgggtct cancctcctt cagcttctcc 240 
tcttctttgg gggaatactc ctcctctgag 300 
g 331 



<210> 306 
<211> 457 
<212> DMA 
<213> Homo sapiens 

<400> 306 

aatatgtaaa ggtaataact tttattatat 
agcagtgcaa aatttaaagg actgttttgt 
aattatatgt atcaaatata taagtaaaaa 
cactttggga ggctgaggca ggtggatcac 
cgatttatag caattttata aatatataac 
aaagtgaatt tgggattttt gtacttggta 
tggatccccc gggctgcagg aattcgatat 
ggggcccggt acccaattcg ccctatagtg 

<210> 307 
<211> 491 
<212> DNA 

<213> Homo sapiens 



taaagacaat gcaaacgaaa aacagaattg 60 
tctcaaagtt gcaagtttca aagccaaaag 120 
aaagttagac tttcaagcct gtaatcccag 180 
taacattaaa aagacaacat tagattttgt 240 
tttgtcactt ggatcctgaa gcaaaataat 300 
aaaagtttaa caccctaaat tcacaactag 360 
caagcttatc gataccgtcg acctcgaggg 420 
agtcgta 457 



<400> 307 

gtgcttggac ggaacccggc gctcgttccc 
ccgtcacctc ttcaccgcac cctcggactg 
gcgcagccac cgccgccgcc gccgcctctc 
ctcgcaggtg cgccagaact accaccagga 
cctggagctc tacgcctcct acgtttacct 
tgtggctttg aagaactttg ccaaatactt 
tgctgagaaa ctgatgaagc tgcagaacca 
caagaaacca gactgtgatg actgggagag 
tttggaaaaa a 

<210> 308 
<211> 421 
<212> DNA 

<213> Homo sapiens 



caccccggcc ggccgcccat agccagccct 60 
ccccaaggcc cccgccgccg ctccagcgcc 120 
cttagtcgcc gccatgacga ccgcgtccac 180 
ctcagaggcc gccatcaacc gccagatcaa 240 
gtccatgtct tactactttg accgcgatga 300 
tcttcaccaa tctcatgagg agagggaaca 360 
acgaggtggc cgaatcttcc ttcaggatat 420 
cgggctgaat gcaatggagt gtgcattaca 480 

491 



<400> 308 

ctcagcgctt cttctttctt ggtttgatcc 
aggccctgga tgtgatggtg tccaccttcc 
tcaagctcaa caagtcagaa ctaaaggagc 
ggaaaaggac agatgaagct gctttccaga 
acaacgaggt ggacttccaa gagtactgtg 
acgaattctt tgaaggcttc ccagataagc 
gtggttgggg ggtctgccag ctggggccct 
c 



tgactgctgt catggcgtgc cctctggaga 60 
acaagtactc gggcaaagag ggtgacaagt 120 
tgctgacccg ggagctgccc agcttcttgg 180 
agctgatgag caacttggac agcaacaggg 240 
tcttcctgtc ctgcatcgcc atgatgtgta 300 
agcccaggaa gaaatgaaaa ctcctctgat 360 
ccctgtcgcc agtgggcact tttttttttc 420 

421 
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<210> 309 
<211> 321 
<212> DNA 
<213> Homo sapiens 



<400> 309 

accaaatggc ggatgacgcc ggtgcagcgg 
tggggaaccg cggtggcttc cgcggaggtt 
gccgtggacg gggccggggc cgaggccgcg 
agtggatgcc cgtcaccaag ttgggccgct 
aggagatcta tctcttctcc ctgcccatta 
gggcctctct caaggatgag g 

<210> 310 
<211> 381 
<212> DNA 
<213> Homo sapiens 



gggggcccgg gggccctggt ggccctggga 60 
tcggcagtgg catccggggc cggggtcgcg 120 
gagctcgcgg aggcaaggcc gaggataagg 180 
tggtcaagga catgaagatc aagtccctgg 240 
aggaatcaga gatcattgat ttcttcctgg 300 

321 



<400> 310 

ttaaccagcc atattggctc aataaatagc 
tcagtgccta tttttcctgg aaactcaatt 
ctgttgtcat tttcattcgg tgacattctc 
cacatttttc atttatagat gtttgcatcc 
ctcattggat ggcttttttt tttttcctcc 
taatgtatgt ttacatctct ttgcaaattc 
aatgtaacaa catactgtga a 

<210> 311 
<211> 538 
<212> DNA 
<213> Homo sapiens 



ttcggtaagg agttaatttc cttctagaaa 60 
ttaaatagtc caattccatc tgaagccaag 120 
tcccatgaca cccagaaggg gcagaagaac 180 
tttgtattaa aattattttg aaggggttgc 240 
agggagaagg ggagaaatgt acttggaaat 300 
ctgtacatag agatatattt tttaagtgtg 360 

381 



<400> 311 

tttgaattta caccaagaac ttctcaataa 
cataccacaa gagaagttaa tttcttaaca 
accaagttct gatatctttt aaagacatag 
tgaaaatatc cttgttgtgt attaggtttt 
gtcatcagta ccctcctatt cagctcccca 
ggttttcttc ttatttttag ataattcaag 
tttatggtaa actcttttaa agaaaattta 
ttaaatcttt atcatagact ctgtacatat 
atcatcggtg ggatgacaga acaaacatat 

<210> 312 
<211> 176 
<212> DNA 
<213> Homo sapiens 



aagaaaatca tgaatgctcc acaatttcaa 60 
ttgtgttcta tgattatttg taagaccttc 120 
ttcaaaattg cttttgaaaa tctgtattct 180 
taaataccag ctaaaggatt acctcactga 240 
agatgatgtg tttttgctta ccctaagaga 300 
tgcttagata aattatgttt tctttaagtg 360 
atatgttata gctgaatctt tttggtaact 420 
gttcaaatta gctgcttgcc tgatgtgtgt 480 
ttatgatcat gaataatgtg ctttgtaa 538 



<400> 312 

ggaggagcag ctgagagata gggtcagtga atgcggttca gcctgctacc tctcctgtct 60 
tcatagaacc attgccttag aattattgta tgacacgttt tttgttggtt aagctgtaag 120 
gttttgttct ttgtgaacat gggtattttg aggggagggt ggagggagta gggaag 176 

<210> 313 
<211> 396 
<212> DNA 
<213> Homo sapiens 
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<400> 313 

ccagcacccc caggccctgg gggacctggg 
tggcgctccc atggctcttg caacatctcc 
agccaccagc ccctcactgg gttcggagga 
gaaacatcgg atttggggaa cgcgtgtcaa 
actgttctgt tccttgtgta actgtgttgc 
gtcaccgggg caactgcctg ggggcgggga 
tataccaaag gtgctacatc tatgtgatgg 

<210> 314 
<211> 311 
<212> DNA 

<213> Homo sapiens 



ttctcagact gccaaagaag ccttgccatc 60 
ccttcgtttt tgagggggtc atgccggggg 120 
gagtcaggaa gggccaagca cgacaaagca 180 
tcccttgtgc cgcagggctg ggcgggagag 240 
tgaaagacta cctcgttctt gtcttgatgt 300 
tgggggcagg gtggaagcgg ctccccattt 360 
9tgggg 396 



<400> 314 

cctcaacatc ctcagagagg actggaagcc 
cctgcagtat ctcttcttgg agcccaaccc 
ggtcctgcag aacaaccggc ggctgtttga 
ctacatcggc tccacctact ttgagcgctg 
cgccacggcc acaagccctg gcatcccctg 
tttggggggc g 



agtccttacg ataaactcca taatttatgg 60 
cgaggaccca ctgaacaagg aggccgcaga 120 
gcagaacgtg cagcgctcca tgcggggtgg 180 
cctgaaatag ggttggcgca tacccacccc 240 
caaatattta ttgggggcca tgggtagggg 300 

311 



<210> 315 
<211> 336 
<212> DNA 

<213> Homo sapiens 



<400> 315 

tttagaacat ggttatcatc caagactact 
aatccacatt cctcttgagt tctgcagctt 
cgtagaatca catgatctga ggaccattca 
gtcttccata aagttttgca tggagcaaac 
agccctctaa aagcataggg cttagcctgc 
gttttgtaaa cactatagca tctgttaaga 



ctaccctgca acattgaact cccaagagca 60 
ctgtgtaaat agggcagctg tcgtctatgc 120 
tggaagctgc taaatagcct agtctgggga 180 
aaacaggatt aaactaggtt tggttccttc 240 
aggcttcctt gggctttctc tgtgtgtgta 300 
tccagt 336 



<210> 316 
<211> 436 
<212> DNA 
<213> Homo sapiens 



<400> 316 

aacatggtct gcgtgcctta agagagacgc 
atgtttccat tggaattgtt ggtaaagact 
tgtctccatt cctggaaggt cttgaagaaa 
ctgctgatga acctgcagaa aaggctgatg 
ctatatatgt attatcaaat atgtaagaat 
atactttgaa ccaaaagttg cagagtggtg 
gtgagttttt tccaagcaac ctcactgaaa 
agggtctgta taatca 



ttcctgcaga acaggacctg actacaaaga 60 
tggagtttac aatctatgat gatgatgatg 120 
gaccacagag aaaggcacag cctgctcaac 180 
aaccaatgga acattaagtg ataagccagt 240 
acaggcacca catactgatg acaataatct 300 
gaatgctatg ttttaggaat cagtccagat 360 
cctatataat ggaatacatt tttctttgaa 420 

436 



<210> 317 
<211> 196 
<212> DNA 

<213> Homo sapiens 



<400> 317 

tattccttgt gaagatgata tactattttt gttaagcgtg tctgtattta tgtgtgagga 60 



wo 02/00174 



134 



PCT/USOl/21065 



gctgctggct tgcagtgcgc gtgcacgtgg agagctggtg cccggagatt ggacggcctg 120 
atgctccctc ccctgccctg gtccagggaa gctggccgag ggtcctggct cctgaggggc 180 
atctgcccct ccccca 196 

<210> 318 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 9, 102, 122, 167, 182, 193, 235, 253, 265, 266, 290, 321, 
378 

<223> n = A,T,C or G 



<400> 318 

gacgcttnng ccgtaacgat gatcggagac atcctgctgt tcgggacgtt gctgatgaat 60 

gccggggcgg tgctgaactt taagctgaaa aagaaggaca cncagggctt tggggaggag 120 

tncagggagc ccaacacagg tgacaacatc cgggaattct tgctgancct cagatacttt 180 

cnaatcttca tcnccctgtg gaacatcttc atgatgttct gcatgattgt gctgntcggc 240 

tcttgaatcc cancgatgaa accannaact cactttcccg ggatgccgan tctccattcc 300 

tccattcctg atgacttcaa naatgttttt gaccaaaaaa ccgacaacct tcccagaaag 360 

tccaagctcg tggtgggngg a 381 

<210> 319 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<400> 319 

ctaagcttta cgaatggggt gacaacttat gataaaaact agagctagtg aattagccta 60 
tttgtaaata cctttgttat aattgatagg atacatcttg gacatggaat tgttaagcca 120 
cctctgagca gtgtatgtca ggacttgttc attaggttgg cagcagaggg gcagaaggaa 180 
ttatacaggt agagatgtat gcagatgtgt ccatatatgt ccatatttac attttgatag 240 
ccattgatgt atgcatctct tggctgtact ataagaacac attaattcaa tggaaataca 300 
ctttgctaat attttaatgg tatagatctg ctaatgaatt ctcttaaaaa catactgtat 360 
tctgttgctg tgtgtttcat tttaaattga gcattaaggg aatgcagcat ttaaatcaga 420 
actctgccaa tgcttttatc tagaggcgtg ttgccatttt tgtcttatat gaaatttctg 480 
tcccaagaaa ggcaggatta catctt 506 

<210> 320 
<211> 351 
<212> DNA 
<213> Homo sapiens 



<400> 320 

ctgacctgca ggacgaaacc atgaagagcc 
cggtagtaac tttgtgttat gaatcacatg 
tcattaacag gagaaatgca aataccttca 
tccaagagag gatccgagaa cgctctaagc 
atgactacag actttgcgaa cgctacgcca 
gctacttcag gaagcgccga gggaccaaat 

<210> 321 
<211> 421 
<212> DNA 
<213> Homo sapiens 



tgatccttct tgccatcctg gccgccttag 60 
aaagcatgga atcttatgaa cttaatccct 120 
tatcccctca gcagagatgg agagctaaag 180 
ctgtccacga gctcaatagg gaagcctgtg 240 
tggtttatgg atacaatgct gcctataatc 300 
gagactgagg gaagaaaaaa a 351 
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<400> 321 

ctcggaggcg ttcagctgct tcaagatgaa 
ccagaaactc attgaagtgg acgatgaacg 
ggccacagaa gttgctgctg ac.gctctggg 
cagtggtggg aacgacaaac aaggtttccc 
tgtccgcctg ctactgagta aggggcattc 
aaagagaaaa tcagttcgtg gttgcattgt 
tattgtaaaa aaaggagaga aggatattcc 
c 



gctgaacatc tccttcccag ccactggctg 60 
caaacttcgt actttctatg agaagcgtat 120 
tgaagaatgg aagggttatg tggtccgaat 180 
catgaagcag ggtgtcttga cccatggccg 240 
ctgttacaga ccaaggagaa ctggagaaag 300 
ggatgcaaat ctgagcgttc tcaacttggt 360 
tggactgact gatactacag tgcctcgccg 420 

421 



<210> 322 
<211> 521 
<212> DNA 

<213> Homo sapiens 



<400> 322 

agcagctctc ctgccacagc tcctcacccc 
tccactccct ccttggtcaa gagcacctca 
gtgctgaaac gaccggagat actgacagat 
ccccttacct cacttgtctc tagccgcagc 
gacacagcag ccaagttcat tggagctggg 
gctgggattg gaactgtgtt tgggagcctc 
aagcaacagc tcttctccta cgccattctg 
ttttgtctga tggtagcctt tctcatcctc 
ccatagttct cccgcgtctg gttggccccg 

<210> 323 
<211> 435 
<212> DNA 

<213> Homo sapiens 



ctgaaaatgt tcgcctgctc caagtttgtc 60 
cagctgctga gccgtccgct atctgcagtg 120 
gagagcctca gcagcttggc agtctcatgt 180 
ttccaaacca gcgccatttc aagggacatc 240 
gctgccacag ttggggtggc tggttctggg 300 
atcattggtt atgccaggaa cccttctctg 360 
ggctttgccc tctcggaggc catggggctc 420 
tttgccatgt gaaggagccg tctccacctc 480 
tgtgttcctt t 521 



<400> 323 

ccgaggtcgc acgcgtgaga cttctccgcc 
tcctacctgc tggctgccct agggggcaac 
atcttggaca gcgtgggtat cgaggcggac 
ctgaatggaa aaaacattga agacgtcatt 
cctgctggtg gggctgtagc cgtctctgct 
tctgcccctg ctgcagcaga ggagaagaaa 
gatgatgaca tgggatttgg cctttttgat 
ttttacacat ctcaa 

<210> 324 
<211> 521 
<212> DNA 
<213> Homo sapiens 



gcagacgccg ccgcgatgcg ctacgtcgcc 60 
tcctccccca gcgccaagga catcaagaag 120 
gacgaccggc tcaacaaggt tatcagtgag 180 
gcccagggta ttggcaagct tgccagtgta 240 
gccccaggct ctgcagcccc tgctgctggt 300 
gatgagaaga aggaggagtc tgaagagtca 360 
taaattcctg ctcccctgca aataaagcct 420 

435 



<400> 324 

aggagatcga ctttcggtgc ccgcaagacc 
tggtgcagta caagaatcgt caggccatcc 
agcacctggt ccagcagcag cccccctcgc 
aaccccagcc tcagcctcag ccgcaacccc 
aacccaagcc tcagccccag cagctccacc 
ctcatcctca ctcgcaccca caccctcacc 
cacacccaca gccgcactcg cagccgcacg 
ctgcctgaaa ggggcagctc ccgggcaaga 
gagcacattt ctattgtctt cacttggatc 



agggctggaa cgccgagatc acgctgcaga 60 
tggcggtcaa atccacgcgg cagaagcagc 120 
agccgcagcc gcagccgcag ctccagcccc 180 
agccccaatc acaaccccag cctcagcccc 240 
cgtatccgca tccacatcca catccacact 300 
cgcacccgca tccgcaccaa ataccgcacc 360 
ggcaccggct tctccgcagc acctccaact 420 
caaggttttg aggacttgag gaagtgggac 480 
aaaagcaaaa c 521 



<210> 325 
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<211> 451 
<212> DNA 
<213> Homo sapiens 



<400> 325 

attttcattt ccattaacct ggaagctttc 
acattattta aacagaaaaa gatgggctct 
tatttttact tagattactt tgggaatgag 
agtgaatgtg tctgtagttg tgttagtttg 
atccaaataa gaggcatata cattgaattg 
acccccaccc ccacccaaga cattttaata 
acatgaggta gtgttccact ggcaggatga 
ctttatcact tgaattatta acttaatttg 

<210> 326 
<211> 421 
<212> DNA 
<213> Homo sapiens 



atgaatattc tcttctttta aaacatttta 60 
ttctggttag ttgttacatg atagcagaga 120 
agattgttgt cttgaactct ggcactgtac 180 
cattaagcat gtataacatt caagtatgtc 240 
tttttaatcc tctgacaagt tgactcttcg 300 
gtaaatagag agagagagaa gagttaatga 360 
cttttcaata gctcaaatca atttcagtgc 420 
a 451 



<220> 

<221> misc_feature 
<222> 296 

<223> n = A,T,C or G 



<400> 326 

cgcggtcgta agggctgagg atttttggtc 
tcgctctcgc cgaggaacaa gtcggtcagg 
ggataccgga aaaacacccg tggagccgga 
aacaagccgc aacgtaaaat ccttggaaaa 
agaaaagaat ctcaaagtga aaggaccagt 
tacaagaaaa actccttgtg gtgaaggttc 
tcacaagcga ctcattgact tgcacagtcc 
c 

<210> 327 
<211> 456 
<212> DNA 
<213> Homo sapiens 



cgcacgctcc tgctcctgac tcaccgctgt 60 
aagcccgcgc gcaacagcca tggcttttaa 120 
ggtggcaatt caccgaattc gaatcaccct 180 
ggtgtgtgct gacttgataa gaggcgcaaa 240 
tcgaatgcct accaagactt tgagantcac 300 
taagacgtgg gatcgtttcc agatgagaat 360 
ttctgagatt gttaagcaga ttacttccat 420 

421 



<400> 327 

atcttgacga ggctgcggtg tctgctgcta 
cgacaagaag aagaaggacg ctggaaagtc 
atccgggggc aaggccaaaa agaagaagtg 
taacttagtc ttgtttgaca aagctaccta 
taaacttata accccagctg tggtctctga 
ggcagccctt caggagctcc ttagtaaagg 
tcaagtaatt tacaccagaa ataccaaggg 
atgaataggt ccaaccagct gtacatttgg 

<210> 328 
<211> 471 
<212> DNA 
<213> Homo sapiens 



ttctccgagc ttcgcaatgc cgcctaagga 60 
ggccaagaaa gacaaagacc cagtgaacaa 120 
gtccaaaggc aaagttcggg acaagctcaa 180 
tgataaactc tgtaaggaag ttcccaacta 240 
gagactgaag attcgaggct ccctggccag 300 
acttatcaaa ctggtttcaa agcacagagc 360 
tggagatgct ccagctgctg gtgaagatgc 420 
aaaaat 456 



<400> 328 

gtggaagtga catcgtcttt aaaccctgcg 
cagggaagac agggcgacct ggaagtccaa 
tgattatccg aaatgtttca ttgtgggagc 



tggcaatccc tgacgcaccg ccgtgatgcc 60 
ctacttcctt aagatcatcc aactattgga 120 
agacaatgtg ggctccaagc agatgcagca 180 
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gatccgcatg tcccttcgcg ggaaggctgt 
caaggccatc cgagggcacc tggaaaacaa 
ccgggggaat gtgggctttg tgttcaccaa 
gctggccaat aaggtgccag ctgctgcccg 
tgtgccagcc cagaacactg gtctcgggcc 

<210> 329 
<211> 278 
<212> DNA 

<213> Homo sapiens 



ggtgctgatg ggcaagaaca ccatgatgcg 240 
cccagctctg gagaaactgc tgcctcatat 300 
ggaggacctc actgagatca gggacatgtt 360 
tgctggtgcc attgccccat gtgaagtcac 420 
cgagaagacc tcctttttcc a 471 



<220> 

<221> misc_feature 
<222> 154, 204 
<223> n = A,T,C or G 



<400> 329 

gtttaaactt aagcttggta ccgagctcgg 

aaattgagat gcccccccag gccagcaaat 

ccttgatatt tttctttttt tttttttttt 

aggtgctatt taacatggga gganagcgtg 

tccaccctct ctccacctgc ctctggcttc 



atccactagt ccagtgtggt ggaattctag 60 
gttccttttt gttcaaagtc tatttttatt 120 
ttgnggatgg ggacttgtga atttttctaa 180 
tgcggctcca gcccagcccg ctgctcactt 240 
tcaggcct 278 



<210> 330 
<211> 338 
<212> DNA 
<213> Homo sapiens 



<400> 330 

ctcaggcttc aacatcgaat acgccgcagg 
cacaaacatt attataataa acaccctcac 
cgcactctcc cctgaactct acacaacata 
cctgttctta tgaattcgaa cagcataccc 
cctatgaaaa aacttcctac cactcaccct 
cattacaatc tccagcattc cccctcaaac 



ccccttcgcc ctattcttca tagccgaata 60 
cactacaatc ttcctaggaa caacatatga 120 
ttttgtcacc aagaccctac ttctaacctc 180 
ccgattccgc tacgaccaac tcatacacct 240 
agcattactt atatgatatg tctccatacc 300 
ctaaaaaa 338 



<210> 331 
<211> 2820 
<212> DNA 

<213> Homo sapiens 



<400> 331 

tggcaaaatc ctggagccag aagaaaggac 
gttgtacctg gaaaacaatg cccagactca 
gctcctgaac agcatggacc agcagattcg 
cacagaccac gcgcagaaca gcgtcacggc 
cttcgatgct ctctctccat cacccgccat 
cagttccgac gtgtccttcc agcagtcgag 
cactgaactg aagaaactct actgccaaat 
gatgacccca cctcctcagg gagctgttat 
gcacgtcacg gaggtggtga agcggtgccc 
gggacagatt gcccctccta gtcatttgat 
tgtagaagat cccatcacag gaagacagag 
tggcactgaa ttcacgacag tcttgtacaa 
gatgaaccgc cgtccaattt taatcattgt 
gggccgacgc tgctttgagg cccggatctg 
tgaagatagc atcagaaagc agcaagtttc 
gcgcccgttt cgtcagaaca cacatggtat 



agcagcattg atcaatctta cagctaacat 60 
atttagtgag ccacagtaca cgaacctggg 120 
gaacggctcc tcgtccacca gtccctataa 180 
gccctcgccc tacgcacagc ccagccccac 240 
cccctccaac accgactacc caggcccgca 300 
caccgccaag tcggccacct ggacgtattc 360 
tgcaaagaca tgccccatcc agatcaaggt 420 
ccgcgccatg cctgtctaca aaaaagctga 480 
caaccatgag ctgagccgtg agttcaacga 540 
tcgagtagag gggaacagcc atgcccagta 600 
tgtgctggta ccttatgagc caccccaggt 660 
tttcatgtgt aacagcagtt gtgttggagg 720 
tactctggaa accagagatg ggcaagtcct 780 
tgcttgccca ggaagagaca ggaaggcgga 840 
ggacagtaca aagaacggtg atggtacgaa 900 
ccagatgaca tccatcaaga aacgaagatc 960 
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cccagatgat gaactgttat acttaccagt gaggggccgt gagacttatg aaatgctgtt 1020 
gaagatcaaa gagtccctgg aactcatgca gtaccttcct cagcacacaa ttgaaacgta 1080 
caggcaacag caacagcagc agcaccagca cttacttcag aaacagacct caatacagtc 1140 
tccatcttca tatggtaaca gctccccacc tctgaacaaa atgaacagca tgaacaagct 1200 
gccttctgtg agccagctta tcaaccctca gcagcgcaac gccctcactc ctacaaccat 1260 
tcctgatggc atgggagcca acattcccat gatgggcacc cacatgccaa tggctggaga 1320 
catgaatgga ctcagcccca cccaggcact ccctccccca ctctccatgc catccacctc 1380 
ccactgcaca cccccacctc cgtatcccac agattgcagc attgtcagtt tcttagcgag 1440 
gttgggctgt tcatcatgtc tggactattt cacgacccag gggctgacca ccatctatca 1500 
gattgagcat tactccatgg atgatctggc aagtctgaaa atccctgagc aatttcgaca 1560 
tgcgatctgg aagggcatcc tggaccaccg gcagctccac gaattctcct ccccttctca 1620 
tctcctgcgg accccaagca gtgcctctac agtcagtgtg ggctccagtg agacccgggg 1680 
tgagcgtgtt attgatgctg tgcgattcac cctccgccag accatctctt tcccaccccg 1740 
agatgagtgg aatgacttca actttgacat ggatgctcgc cgcaataagc aacagcgcat 1800 
caaagaggag ggggagtgag cctcaccatg tgagctcttc ctatccctct cctaactgcc 1860 
agccccctaa aagcactcct gcttaatctt caaagccttc tccctagctc ctccccttcc 1920 
tcttgtctga tttcttaggg gaaggagaag taagaggcta cctcttacct aacatctgac 1980 
ctggcatcta attctgattc tggctttaag ccttcaaaac tatagcttgc agaactgtag 2040 
ctgccatggc taggtagaag tgagcaaaaa agagttgggt gtctccttaa gctgcagaga 2100 
tttctcattg acttttataa agcatgttca cccttatagt ctaagactat atatataaat 2160 
gtataaatat acagtataga tttttgggtg gggggcattg agtattgttt aaaatgtaat 2220 
ttaaatgaaa gaaaattgag ttgcacttat tgaccatttt ttaatttact tgttttggat 2280 
ggcttgtcta tactccttcc cttaaggggt atcatgtatg gtgataggta tctagagctt 2340 
aatgctacat gtgagtgcga tgatgtacag attctttcag ttctttggat tctaaataca 2400 
tgccacatca aacctttgag tagatccatt tccattgctt attatgtagg taagactgta 2460 
gatatgtatt cttttctcag tgttggtata ttttatatta ctgacatttc ttctagtgat 2520 
gatggttcac gttggggtga tttaatccag ttataagaag aagttcatgt ccaaacggtc 2580 
ctctttagtt tttggttggg aatgaggaaa attcttaaaa ggcccatagc agccagttca 2640 
aaaacacccg acgtcatgta tttgagcata tcagtaaccc ccttaaattt aatacccaga 2700 
taccttatct tacaatgttg attgggaaaa catttgctgc ccattacaga ggtattaaaa 2760 
ctaaatttca ctactagatt gactaactca aatacacatt tgctactgtt gtaagaattc 2820 



<210> 332 

<211> 2270 

<212> DNA 

<213> Homo sapiens 

<400> 332 

tcgttgatat caaagacagt tgaaggaaat gaattttgaa acttcacggt gtgccaccct 60 
acagtactgc cctgaccctt acatccagcg tttcgtagaa acccagctca tttctcttgg 120 
aaagaaagtt attaccgatc caccatgtcc cagagcacac agacaaatga attcctcagt 180 
ccagaggttt tccagcatat ctgggatttt ctggaacagc ctatatgttc agttcagccc 240 
attgacttga actttgtgga tgaaccatca gaagatggtg cgacaaacaa gattgagatt 300 
agcatggact gtatccgcat gcaggactcg gacctgagtg accccatgtg gccacagtac 360 
acgaacctgg ggctcctgaa cagcatggac cagcagattc agaacggctc ctcgtccacc 420 
agtccctata acacagacca cgcgcagaac agcgtcacgg cgccctcgcc ctacgcacag 480 
cccagctcca ccttcgatgc tctctctcca tcacccgcca tcccctccaa caccgactac 540 
ccaggcccgc acagtttcga cgtgtccttc cagcagtcga gcaccgccaa gtcggccacc 600 
tggacgtatt ccactgaact gaagaaactc tactgccaaa ttgcaaagac atgccccatc 660 
cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 780 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
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gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 1260 
gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacagacc 1380 
tcaatacagt ctccatcttc atatggtaac agctccccac ctctgaacaa aatgaacagc 1440 
atgaacaagc tgccttctgt gagccagctt atcaaccctc agcagcgcaa cgccctcact 1500 
cctacaacca ttcctgatgg catgggagcc aacattccca tgatgggcac ccacatgcca 1560 
atggctggag acatgaatgg actcagcccc acccaggcac tccctccccc actctccatg 1620 
ccatccacct cccactgcac acccccacct ccgtatccaa cagattgcag cattgtcggt 1680 
ttcttagcga ggttgggctg ttcatcatgt ctggactatt tcacgaccca ggggctgacc 1740 
accatctatc agattgagca ttactccatg gatgatctgg caagtctgaa aatccctgag 1800 
caatttcgac atgcgatctg gaagggcatc ctggaccacc ggcagctcca cgaattctcc 1860 
tccccttctc atctcctgcg gaccccaagc agtgcctcta cagtcagtgt gggctccagt 1920 
gagacccggg gtgagcgtgt tattgatgct gtgcgattca ccctccgcca gaccatctct 1980 
ttcccacccc gagatgagtg gaatgacttc aactttgaca tggatgctcg ccgcaataag 2040 
caacagcgca tcaaagagga gggggagtga gcctcaccat gtgagctctt cctatccctc 2100 
tcctaactgc cagcccccta aaagcactcc tgcttaatct tcaaagcctt ctccctagct 2160 
cctccccttc ctcttgtctg atttcttagg ggaaggagaa gtaagaggct acctcttacc 2220 
taacatctga cctggcatct aattctgatt ctggctttaa gccttcaaaa 2270 

<210> 333 

<211> 2816 

<212> DNA 

<213> Homo sapiens 

<400> 333 

tcgttgatat caaagacagt tgaaggaaat gaattttgaa acttcacggt gtgccaccct 60 
acagtactgc cctgaccctt acatccagcg tttcgtagaa acccagctca tttctcttgg 120 
aaagaaagtt attaccgatc caccatgtcc cagagcacac agacaaatga attcctcagt 180 
ccagaggttt tccagcatat ctgggatttt ctggaacagc ctatatgttc agttcagccc 240 
attgacttga actttgtgga tgaaccatca gaagatggtg cgacaaacaa gattgagatt 300 
agcatggact gtatccgcat gcaggactcg gacctgagtg accccatgtg gccacagtac 360 
acgaacctgg ggctcctgaa cagcatggac cagcagattc agaacggctc ctcgtccacc 420 
agtccctata acacagacca cgcgcagaac agcgtcacgg cgccctcgcc ctacgcacag 480 
cccagctcca ccttcgatgc tctctctcca tcacccgcca tcccctccaa caccgactac 540 
ccaggcccgc acagtttcga cgtgtccttc cagcagtcga gcaccgccaa gtcggccacc 600 
tggacgtatt ccactgaact gaagaaactc tactgccaaa ttgcaaagac atgccccatc 660 
cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 780 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 1260 
gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacatctc 1380 
ctttcagcct gcttcaggaa tgagcttgtg gagccccgga gagaaactcc aaaacaatct 1440 
gacgtcttct ttagacattc caagccccca aaccgatcag tgtacccata gagccctatc 1500 
tctatatttt aagtgtgtgt gttgtatttc catgtgtata tgtgagtgtg tgtgtgtgta 1560 
tgtgtgtgcg tgtgtatcta gccctcataa acaggacttg aagacacttt ggctcagaga 1620 
cccaactgct caaaggcaca aagccactag tgagagaatc ttttgaaggg actcaaacct 1680 
ttacaagaaa ggatgttttc tgcagatttt gtatccttag accggccatt ggtgggtgag 1740 
gaaccactgt gtttgtctgt gagctttctg ttgtttcctg ggagggaggg gtcaggtggg 1800 
gaaaggggca ttaagatgtt tattggaacc cttttctgtc ttcttctgtt gtttttctaa 1860 
aattcacagg gaagcttttg agcaggtctc aaacttaaga tgtcttttta agaaaaggag 1920 
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aaaaaagttg ttattgtctg tgcataagta agttgtaggt gactgagaga ctcagtcaga 1980 

cccttttaat gctggtcatg taataatatt gcaagtagta agaaacgaag gtgtcaagtg 2040 

tactgctggg cagcgaggtg atcattacca aaagtaatca actttgtggg tggagagttc 2100 
tttgtgagaa cttgcattat ttgtgtcctc ccctcatgtg taggtagaac atttcttaat 2160 

gctgtgtacc tgcctctgcc actgtatgtt ggcatctgtt atgctaaagt ttttcttgta 2220 

catgaaaccc tggaagacct actacaaaaa aactgttgtt tggcccccat agcaggtgaa 2280 

ctcattttgt gcttttaata gaaagacaaa tccaccccag taatattgcc cttacgtagt 2340 

tgtttaccat tattcaaagc tcaaaataga atttgaagcc ctctcacaaa atctgtgatt 2400 

aatttgctta attagagctt ctatccctca agcctaccta ccataaaacc agccatatta 2460 

ctgatactgt tcagtgcatt tagccaggag acttacgttt tgagtaagtg agatccaagc 2520 

agacgtgtta aaatcagcac tcctggactg gaaattaaag attgaaaggg tagactactt 2580 

ttcttttttt tactcaaaag tttagagaat ctctgtttct ttccatttta aaaacatatt 2640 

ttaagataat agcataaaga ctttaaaaat gttcctcccc tccatcttcc cacacccagt 2700 

caccagcact gtattttctg tcaccaagac aatgatttct tgttattgag gctgttgctt 2760 

ttgtggatgt gtgattttaa ttttcaataa acttttgcat cttggtttaa aagaaa 2816 

<210> 334 

<211> 2082 

<212> DNA 

<213> Homo sapiens 

<400> 334 

agatgctaca gcgactgcac acccaggctg tatgatacag cctattgctc ccgggctgca 60 
aacctgtcca gcatgtgatg tggtgggata ctgaattgaa taccgaatac tgtaggcaat 120 
tgtaacacag tggtaagtct ttgtgtatct aaacatagct aaacaccaaa aggtatagta 180 
agaatatggt attataatct tatggaacta tcattgtata tgtggtttgt caaccagaat 240 
gtagttatac agcacaggac tgtgcttatg atgtgccaag cacagctctc agtactaact 300 
cctttaatct tcatatcaac cctaggaggt aacttcttaa gtagattcat attgtaaggg 360 
tctcggggtg ggggggttgg caaaatcctg gagccagaag aaaggacagc agcattgatc 420 
aatcttacag ctaacatgtt gtacctggaa aacaatgccc agactcaatt tagtgagcca 480 
cagtacacga acctggggct cctgaacagc atggaccagc agattcagaa cggctcctcg 540 
tccaccagtc cctataacac agaccacgcg cagaacagcg tcacggcgcc ctcgccctac 600 
gcacagccca gctccacctt cgatgctctc tctccatcac ccgccatccc ctccaacacc 660 
gactacccag gcccgcacag tttcgacgtg tccttccagc agtcgagcac cgccaagtcg 720 
gccacctgga cgtattccac tgaactgaag aaactctact gccaaattgc aaagacatgc 780 
cccatccaga tcaaggtgat gaccccacct cctcagggag ctgttatccg cgccatgcct 840 
gtctacaaaa aagctgagca cgtcacggag gtggtgaagc ggtgccccaa ccatgagctg 900 
agccgtgaat tcaacgaggg acagattgcc cctcctagtc atttgattcg agtagagggg 960 
aacagccatg cccagtatgt agaagatccc atcacaggaa gacagagtgt gctggtacct 1020 
tatgagccac cccaggttgg cactgaattc acgacagtct tgtacaattt catgtgtaac 1080 
agcagttgtg ttggagggat gaaccgccgt ccaattttaa tcattgttac tctggaaacc 1140 
agagatgggc aagtcctggg ccgacgctgc tttgaggccc ggatctgtgc ttgcccagga 1200 
agagacagga aggcggatga agatagcatc agaaagcagc aagtttcgga cagtacaaag 1260 
aacggtgatg gtacgaagcg cccgtctcgt cagaacacac atggtatcca gatgacatcc 1320 
atcaagaaac gaagatcccc agatgatgaa ctgttatact taccagtgag gggccgtgag 1380 
acttatgaaa tgctgttgaa gatcaaagag tccctggaac tcatgcagta ccttcctcag 1440 
cacacaattg aaacgtacag gcaacagcaa cagcagcagc accagcactt acttcagaaa 1500 
cagtgagtgt atcaacgtgt cattttagga ggcatgagtg acggtgactt tatttggatc 1560 
agcaataggg tgattgatga gcaatgtgga acataatggg agatagcaga ttgtcataga 1620 
ttcagatgac ctggtatggc aaccctcttt cagttgcaac cttttttacg tgtcttatta 1680 
taaccttccc ttcagaattc cacttatgtt ctgaaattaa atacaaacca tttctggtga 1740 
attacaaaga aactcacact aacagttctc ttctctatat gcctggtcca tacacactaa 1800 
cagtaagtac acactctatt tggtagtgat gtgtatattt gaaaacatga aatcttttct 1860 
catcccaatg gattgtctta taaatctcct gggatgcaca ctatccactt ttgggaataa 1920 
cactgtagac cagggatagc aaataggctt tactataata taaagtgact tgtttgaatg 1980 
ctgtaatgag aagaattctg agacctagtg catgataatt ggggaaatat ctgggtgcag 2040 
aaggataagg tagcatcatg ttgccgtatt ttagcatctc tg 2082 
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<210> 335 

<211> 4849 

<212> DNA 

<213> Homo sapiens 

<400> 335 

cgttgatatc aaagacagtt gaaggaaatg 
cagtactgcc ctgaccctta catccagcgt 
aaagaaagtt attaccgatc caccatgtcc 
ccagaggttt tccagcatat ctgggatttt 
attgacttga actttgtgga tgaaccatca 
agcatggact gtatccgcat gcaggactcg 
acgaacctgg ggctcctgaa cagcatggac 
agtccctata acacagacca cgcgcagaac 
cccagctcca ccttcgatgc tctctctcca 
ccaggcccgc acagtttcga cgtgtccttc 
tggacgtatt ccactgaact gaagaaactc 
cagatcaagg tgatgacccc acctcctcag 
aaaaaagctg agcacgtcac ggaggtggtg 
gaattcaacg agggacagat tgcccctcct 
catgcccagt atgtagaaga tcccatcaca 
ccaccccagg ttggcactga attcacgaca 
tgtgttggag ggatgaaccg ccgtccaatt 
gggcaagtcc tgggccgacg ctgctttgag 
aggaaggcgg atgaagatag catcagaaag 
gatggtacga agcgcccgtt tcgtcagaac 
aaacgaagat ccccagatga tgaactgtta 
gaaatgctgt tgaagatcaa agagtccctg 
attgaaacgt acaggcaaca gcaacagcag 
tcaatacagt ctccatcttc atatggtaac 
atgaacaagc tgccttctgt gagccagctt 
cctacaacca ttcctgatgg catgggagcc 
atggctggag acatgaatgg actcagcccc 
ccatccacct cccagtgcac acccccacct 
ttcttagcga ggttgggctg ttcatcatgt 
accatctatc agattgagca ttactccatg 
caatttcgac atgcgatctg gaagggcatc 
tccccttctc atctcctgcg gaccccaagc 
gagacccggg gtgagcgtgt tattgatgct 
ttcccacccc gagatgagtg gaatgacttc 
caacagcgca tcaaagagga gggggagtga 
tcctaactgc cagcycccta aaagcactcc 
cctccccttc ctcttgtctg atttcttagg 
taacatctga cctggcatct aattctgatt 
cagaactgta gctgccatgg ctaggtagaa 
agctgcagag atttctcatt gacttttata 
tatatataaa tgtataaata tacagtatag 
taaaatgtaa tttaaatgaa agaaaattga 
ttgttttgga tggcttgtct atactccttc 
atctagagct taatgctaca tgtgagtgac 
attctaaata catgccacat caaacctttg 
ggtaagactg tagatatgta ttcttttctc 
tcttctagtg atgatggttc acgttggggt 
gtccaaacgt cctctttagt ttttggttgg 
cagccagttc aaaaacaccc gacgtcatgt 
taataccaga taccttatct tacaatattg 
gtattaaaac taaatttcac tactagattg 
taagaattct gattgatttg attgggatga 



aattttgaaa cttcacggtg tgccacccta 60 
ttcgtagaaa ccccagctca tttctcttgg 120 
cagagcacac agacaaatga attcctcagt 180 
ctggaacagc ctatatgttc agttcagccc 240 
gaagatggtg cgacaaacaa gattgagatt 300 
gacctgagtg accccatgtg gccacagtac 360 
cagcagattc agaacggctc ctcgtccacc 420 
agcgtcacgg cgccctcgcc ctacgcacag 480 
tcacccgcca tcccctccaa caccgactac 540 
cagcagtcga gcaccgccaa gtcggccacc 600 
tactgccaaa ttgcaaagac atgccccatc 660 
ggagctgtta tccgcgccat gcctgtctac 720 
aagcggtgcc ccaaccatga gctgagccgt 780 
agtcatttga ttcgagtaga ggggaacagc 840 
ggaagacaga gtgtgctggt accttatgag 900 
gtcttgtaca atttcatgtg taacagcagt 960 
ttaatcattg ttactctgga aaccagagat 1020 
gcccggatct gtgcttgccc aggaagagac 1080 
cagcaagttt cggacagtac aaagaacggt 1140 
acacatggta tccagatgac atccatcaag 1200 
tacttaccag tgaggggccg tgagacttat 1260 
gaactcatgc agtaccttcc tcagcacaca 1320 
cagcaccagc acttacttga gaaacagacc 1380 
agctccccac ctctgaacaa aatgaacagc 1440 
atcaaccctc agcagcgcaa cgccctcact 1500 
aacattccca tgatgggcac ccacatgcca 1560 
acccaggcac tccctccccc actctccatg 1620 
ccgtatccca cagattgcag cattgtcagt 1680 
ctggactatt tcacgaccca ggggctgacc 1740 
gatgatctgg caagtctgaa aatccctgag 1800 
ctggaccacc ggcagctcca cgaattctcc 1860 
agtgcctcta cagtcagtgt gggctccagt 1920 
gtgcgattca ccctccgcca gaccatctct 1980 
aactttgaca tggatgctcg ccgcaataag 2040 
gcctcaccat gtgagctctt cctatccctc 2100 
tgcttaatct tcaaagcctt ctccctagct 2160 
ggaaggagaa gtaagaggct acctcttacc 2220 
ctggctttaa gccttcaaaa ctatagcttg 2280 
gtgagcaaaa aagagttggg tgtctcctta 2340 
aagcatgttc acccttatag tctaagacta 2400 
atttttgggt ggggggcatt gagtattgtt 24 60 
gttgcactta ttgaccattt tttaatttac 2520 
ccttaagggg tatcatgtat ggtgataggt 2580 
gatgatgtac agattctttc agttctttgg 2640 
agtagatcca tttccattgc ttattatgta 2700 
agtgttggta tattttatat tactgacatt 2760 
gatttaatcc agttataaga agaagttcat 2820 
gaatgaggaa aattcttaaa aggcccatag 2880 
atttgagcat atcagtaacc cccttaaatt 2940 
attgggaaaa catttgctgc cattacagag 3000 
actaactcaa atacacattt gctactgttg 3060 
atgccatcta tctagttcta acagtgaagt 3120 
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tttactgtct attaatattc agggtaaata ggaatcattc agaaatgttg agtctgtact 3180 
aaacagtaag atatctcaat gaaccataaa ttcaactttg taaaaatctt ttgaagcata 3240 
gataatattg tttggtaaat gtttcttttg tttggtaaat gtttctttta aagaccctcc 3300 
tattctataa aactctgcat gtagaggctt gtttaccttt ctctctctaa ggtttacaat 3360 
aggagtggtg atttgaaaaa tataaaatta tgagattggt tttcctgtgg cataaattgc 3420 
atcactgtat cattttcttt tttaaccggt aagagtttca gtttgttgga aagtaactgt 3480 
gagaacccag tttcccgtcc atctccctta gggactaccc atagacatga aaggtcccca 3540 
cagagcaaga gataagtctt tcatggctgc tgttgcttaa accacttaaa cgaagagttc 3600 
ccttgaaact ttgggaaaac atgttaatga caatattcca gatctttcag aaatataaca 3660 
catttttttg catgcatgca aatgagctct gaaatcttcc catgcattct ggtcaagggc 3720 
tgtcattgca cataagcttc cattttaatt ttaaagtgca aaagggccag cgtggctcta 3780 
aaaggtaatg tgtggattgc ctctgaaaag tgtgtatata ttttgtgtga aattgcatac 3840 
tttgtatttt gattattttt tttttcttct tgggatagtg ggatttccag aaccacactt 3900 
gaaacctttt tttatcgttt ttgtattttc atgaaaatac catttagtaa gaataccaca 3960 
tcaaataaga aataatgcta caattttaag aggggaggga agggaaagtt tttttttatt 4020 
atttttttaa aattttgtat gttaaagaga atgagtcctt gatttcaaag ttttgttgta 4080 
cttaaatggt aataagcact gtaaacttct gcaacaagca tgcagctttg caaacccatt 4140 
aaggggaaga atgaaagctg ttccttggtc ctagtaagaa gacaaactgc ttcccttact 4200 
ttgctgaggg tttgaataaa cctaggactt ccgagctatg tcagtactat tcaggtaaca 4260 
ctagggcctt ggaaattcct gtactgtgtc tcatggattt ggcactagcc aaagcgaggc 4320 
acccttactg gcttacctcc tcatggcagc ctactctcct tgagtgtatg agtagccagg 4380 
gtaaggggta aaaggatagt aagcatagaa accactagaa agtgggctta atggagttct 4440 
tgtggcctca gctcaatgca gttagctgaa gaattgaaaa gtttttgttt ggagacgttt 4500 
ataaacagaa atggaaagca gagttttcat taaatccttt tacctttttt ttttcttggt 4560 
aatcccctaa aataacagta tgtgggatat tgaatgttaa agggatattt tttttctatt 4620 
atttttataa ttgtacaaaa ttaagcaaat gttaaaagtt ttatatgctt tattaatgtt 4680 
ttcaaaaggt attatacatg tgatacattt tttaagcttc agttgcttgt cttctggtac 4740 
tttctgttat gggcttttgg ggagccagaa gccaatctac aatctctttt tgtttgccag 4800 
gacatgcaat aaaatttaaa aaataaataa aaactaatta agaaataaa 4849 

<210> 336 

<211> 1386 

<212> DNA 

<213> Homo sapiens 

<400> 336 

atgttgtacc tggaaaacaa tgcccagact caatttagtg agccacagta cacgaacctg 60 
gggctcctga acagcatgga ccagcagatt cagaacggct cctcgtccac cagtccctat 120 
aacacagacc acgcgcagaa cagcgtcacg gcgccctcgc cctacgcaca gcccagctcc 180 
accttcgatg ctctctctcc atcacccgcc atcccctcca acaccgacta cccaggcccg 240 
cacagtttcg acgtgtcctt ccagcagtcg agcaccgcca agtcggccac ctggacgtat 300 
tccactgaac tgaagaaact ctactgccaa attgcaaaga catgccccat ccagatcaag 360 
gtgatgaccc cacctcctca gggagctgtt atccgcgcca tgcctgtcta caaaaaagct 420 
gagcacgtca cggaggtggt gaagcggtgc cccaaccatg agctgagccg tgaattcaac 480 
gagggacaga ttgcccctcc tagtcatttg attcgagtag aggggaacag ccatgcccag 540 
tatgtagaag atcccatcac aggaagacag agtgtgctgg taccttatga gccaccccag 600 
gttggcactg aattcacgac agtcttgtac aatttcatgt gtaacagcag ttgtgttgga 660 
gggatgaacc gccgtccaat tttaatcatt gttactctgg aaaccagaga tgggcaagtc 720 
ctgggccgac gctgctttga ggcccggatc tgtgcttgcc caggaagaga caggaaggcg 780 
gatgaagata gcatcagaaa gcagcaagtt tcggacagta caaagaacgg tgatggtacg 840 
aagcgcccgt ttcgtcagaa cacacatggt atccagatga catccatcaa gaaacgaaga 900 
tccccagatg atgaactgtt atacttacca gtgaggggcc gtgagactta tgaaatgctg 960 
ttgaagatca aagagtccct ggaactcatg cagtaccttc ctcagcacac aattgaaacg 1020 
tacaggcaac agcaacagca gcagcaccag cacttacttc agaaacagac ctcaatacag 1080 
tctccatctt catatggtaa cagctcccca cctctgaaca aaatgaacag catgaacaag 1140 
ctgccttctg tgagccagct tatcaaccct cagcagcgca acgccctcac tcctacaacc 1200 
attcctgatg gcatgggagc caacattccc atgatgggca cccacatgcc aatggctgga 1260 
gacatgaatg gactcagccc cacccaggca ctccctcccc cactctccat gccatccacc 1320 
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tcccactgca cacccccacc tccgtatccc acagattgca gcattgtcag gatctggcaa 1380 
gtctga 1386 

<210> 337 
<211> 1551 
<212> DNA 

<213> Homo sapiens 
<400> 337 

atgtcccaga gcacacagac aaatgaattc ctcagtccag aggttttcca gcatatctgg 60 

gattttctgg aacagcctat atgttcagtt cagcccattg acttgaactt tgtggatgaa 120 

ccatcagaag atggtgcgac aaacaagatt gagattagca tggactgtat ccgcatgcag 180 

gactcggacc tgagtgaccc catgtggcca cagtacacga acctggggct cctgaacagc 240 

atggaccagc agattcagaa cggctcctcg tccaccagtc cctataacac agaccacgcg 300 

cagaacagcg tcacggcgcc ctcgccctac gcacagccca gctccacctt cgatgctctc 360 

tctccatcac ccgccatccc ctccaacacc gactacccag gcccgcacag tttcgacgtg 420 

tccttccagc agtcgagcac cgccaagtcg gccacctgga cgtattccac tgaactgaag 480 

aaactctact gccaaattgc aaagacatgc cccatccaga tcaaggtgat gaccccacct 540 

cctcagggag ctgttatccg cgccatgcct gtctacaaaa aagctgagca cgtcacggag 600 

gtggtgaagc ggtgccccaa ccatgagctg agccgtgaat tcaacgaggg^ acagattgcc 660 

cctcctagtc atttgattcg agtagagggg aacagccatg cccagtatgt agaagatccc 720 

atcacaggaa gacagagtgt gctggtacct tatgagccac cccaggttgg cactgaattc 780 

acgacagtct tgtacaattt catgtgtaac agcagttgtg ttggagggat gaaccgccgt 840 

ccaattttaa tcattgttac tctggaaacc agagatgggc aagtcctggg ccgacgctgc 900 

tttgaggccc ggatctgtgc ttgcccagga agagacagga aggcggatga agatagcatc 960 

agaaagcagc aagtttcgga cagtacaaag aacggtgatg gtacgaagcg cccgtttcgt 1020 

cagaacacac atggtatcca gatgacatcc atcaagaaac gaagatcccc agatgatgaa 1080 

ctgttatact taccagtgag gggccgtgag acttatgaaa tgctgttgaa gatcaaagag 1140 

tccctggaac tcatgcagta ccttcctcag cacacaattg aaacgtacag gcaacagcaa 1200 

cagcagcagc accagcactt acttcagaaa cagacctcaa tacagtctcc atcttcatat 1260 

ggtaacagct ccccacctct gaacaaaatg aacagcatga acaagctgcc ttctgtgagc 1320 

cagcttatca accctcagca gcgcaacgcc ctcactccta caaccattcc tgatggcatg 1380 

,ggagccaaca ttcccatgat gggcacccac atgccaatgg ctggagacat gaatggactc 1440 

agccccaccc aggcactccc tcccccactc tccatgccat ccacctccca ctgcacaccc 1500 

ccacctccgt atcccacaga ttgcagcatt gtcaggatct ggcaagtctg a 1551 

<210> 338 . 
<211> 586 
<212> PRT 

<213> Homo sapiens 



.<400> 338 



Met Leu 


Tyr 


Leu 


Glu 


Asn 


Asn 


Ala 


Gin 


Thr 


Gin 


Phe 


Ser 


Glu 


Pro Gin 


1 






5 










10 








15 


Tyr Thr 


hsn 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser 


Met 


Asp 


Gin 


Gin 


lie 


Arg Asn 






20 










25 










30 




Gly Ser 


Ser 


Ser 


Thr 


Ser 


Pro 


Tyr 


Asn 


Thr 


Asp 


His 


Ala 


Gin 


Asn Ser 




35 










40 










45 






Val Thr 


Ala 


Pro 


Ser 


Pro 


Tyr 


Ala 


Gin 


Pro 


Ser 


Pro 


Thr 


Phe 


Asp Ala 


50 










55 










60 






Leu Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 


Asn 


Thr 


Asp 


Tyr 


Pro 


Gly Pro 


65 








70 










75 








80 


His Ser 


Ser 


Asp 


val 


Ser 


Phe 


Gin 


Gin 


Ser 


Ser 


Thr 


Ala 


Lys 


Ser Ala 








85 










90 








95 


Thr Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys 


Lys 


Leu 


Tyr 


Cys 


Gin 


He Ala 






100 










105 








110 




Lys Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val 


Met 


Thr 


Pro 


Pro 


Pro 


Gin Gly 




115 










120 










125 
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Ala Val He Arg Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr 

130 135 140 

Glu Val Val Lys Arg Cys Pro Asn His Glu Leu Ser Arg Glu Phe Asn 
145 150 155 160 

Glu Gly Gin He Ala Pro Pro Ser His Leu He Arg Val Glu Gly Asn 

165 170 175 

Ser His Ala Gin Tyr Val Glu Asp Pro He Thr Gly Arg Gin Ser Val 

180 185 190 

Leu Val Pro Tyr Glu Pro Pro Gin Val Gly Thr Glu Phe Thr Thr Val 

195 200 205 

Leu Tyr Asn Phe Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg 

210 215 220 

Arg Pro He Leu He He Val Thr Leu Glu Thr Arg Asp Gly Gin Val 
225 230 235 ' 240 

Leu Gly Arg Arg Cys Phe Glu Ala Arg He Cys Ala Cys Pro Gly Arg 

245 250 255 

Asp Arg Lys Ala Asp Glu Asp Ser He Arg Lys Gin Gin Val Ser Asp 

260 265 270 

Ser Thr Lys Asn Gly Asp Gly Thr Lys Arg Pro Phe Arg Gin Asn Thr 

275 280 285 

His Gly He Gin Met Thr Ser He Lys Lys Arg Arg Ser Pro Asp Asp 

290 295 300 

■ Glu Leu Leu Tyr Leu Pro Val Arg Gly Arg Glu Thr Tyr Glu Met Leu 
305 310 315 320 

Leu Lys He Lys Glu Ser Leu Glu Leu Met Gin Tyr Leu Pro Gin His 

325 330 335 

Thr He Glu Thr Tyr Arg Gin Gin Gin Gin Gin Gin His Gin His Leu 

340 345 350 

Leu Gin Lys Gin Thr Ser He Gin Ser Pro Ser Ser Tyr Gly Asn Ser 

355 360 365 

Ser Pro Pro Leu Asn Lys Met Asn Ser Met Asn Lys Leu Pro Ser Val 

370 375 380 

Ser Gin Leu He Asn Pro Gin Gin Arg Asn Ala Leu Thr Pro Thr Thr 
385 390 395 400 

He Pro Asp Gly Met Gly Ala Asn He Pro Met Met Gly Thr His Met 

405 410 415 

Pro Met Ala Gly Asp Met Asn Gly Leu Ser Pro Thr Gin Ala Leu Pro 

420 425 430 

Pro Pro Leu Ser Met Pro Ser Thr Ser His Cys Thr Pro Pro Pro Pro 

435 440 445 

Tyr Pro Thr Asp Cys Ser He Val Ser Phe Leu Ala Arg Leu Gly Cys 

450 455 460 

Ser Ser Cys Leu Asp Tyr Phe Thr Thr Gin Gly Leu Thr Thr He Tyr 
465 470 475 480 

Gin He Glu His Tyr Ser Met Asp Asp Leu Ala Ser Leu Lys He Pro 

485 490 495 

Glu Gin Phe Arg His Ala He Trp Lys Gly He Leu Asp His Arg Gin 

500 505 510 

Leu His Glu Phe Ser Ser Pro Ser His Leu Leu Arg Thr Pro Ser Ser 

515 520 525 

Ala Ser Thr Val Ser Val Gly Ser Ser Glu Thr Arg Gly Glu Arg Val 
530 535 

He Asp Ala Val Arg Phe Thr Leu Arg Gin Thr He Ser Phe Pro Pro 
545 550 555 560 

Arg Asp Glu Trp Asn Asp Phe Asn Phe Asp Met Asp Ala Arg Arg Asn 

565 570 575 

Lys Gin Gin Arg He Lys Glu Glu Gly Glu 
580 585 
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<210> 339 
<211> 641 
<212> PRT 

<213> Homo sapiens 



<400> 339 



Met Ser 


Gin 


Ser 


Thr 


Gin Thr Asn 


1 






5 




Gin His 


He 


Trp 


Asp 


Phe Leu Glu 






20 






lie Asp 


Leu 


Asn 


Phe 


Val Asp Glu 




35 






40 


Lys lie 


Glu 


He 


Ser 


Met Asp Cys 


50 








55 


Ser Asp 


Pro 


Met 


Trp 


Pro Gin Tyr 


65 








70 


Met Asp 


Gin 


Gin 


He 


Gin Asn Gly 








85 




Thr Asp 


His 


Ala 


Gin 


Asn Ser Val 






100 






Pro Ser 


Ser 


Thr 


Phe 


Asp Ala Leu 




115 






120 


Asn Thr 


Asp 


Tyr 


Pro 


Gly Pro His 


130 








135 


Ser Ser 


Thr 


Ala 


Lys 


Ser Ala Thr 


145 








150 


Lys Leu 


Tyr 


Cys 


Gin 


He Ala Lys 








165 




Met Thr 


Pro 


Pro 


Pro 


Gin Gly Ala 






180 




Lys Lys 


Ala 


Glu 


His 


Val Thr Glu 




195 






200 


Glu Leu 


Ser 


Arg 


Glu 


Phe Asn Glu 


210 








215 


Leu lie 


Arg 


Val 


Glu 


Gly Asn Ser 


225 








230 


lie Thr 


Gly 


Arg 


Gin 


Ser Val Leu 








245 




Gly Thr 


Glu 


Phe 


Thr 


Thr Val Leu 






260 






Cys Val 


Gly 


Gly 


Met 


Asn Arg Arg 




275 






280 


Glu Thr 


Arg 


Asp 


Gly 


Gin Val Leu 


290 








295 


lie Cys 


Ala 


Cys 


Pro 


Gly Arg Asp 


305 








310 


Arg Lys 


Gin 


Gin 


Val 


Ser Asp Ser 








325 




Arg Pro 


Phe 


Arg 


Gin 


Asn Thr His 






340 






Lys Arg 


Arg 


Ser 


Pro 


Asp Asp Glu 




355 






360 


Arg Glu 


Thr 


Tyr 


Glu 


Met Leu Leu 


370 








375 


Met Gin 


Tyr 


Leu 


Pro 


Gin His Thr 


385 








390 



Glu Phe 


Leu 


Ser 


Pro 


Glu 


Val 


Phe 


10 










15 




Gin Pro 


He 


Cys 


Ser 


Val 


Gin 


Pro 


25 








30 






Pro Ser 


Glu 


Asp 


Gly 


Ala 


Thr. 


Asn 








45 








He Arg 


Met 


Gin 


Asp 


Ser 


Asp 


Leu 






60 










Thr Asn 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser 




75 










80 


Ser Ser 


Ser 


Thr 


Ser 


Pro 


Tyr 


Asn 


90 










95 




Thr Ala 


Pro 


Ser 


Pro 


Tyr 


Ala 


Gin 


105 








110 






Ser Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 








125 








Ser Phe 


Asp 


Val 


Ser 


Phe 


Gin 


Gin 






140 










Trp Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys 




155 










160 


Thr Cys 


Pro 


He 


Gin 


He 


Lys 


Val 


170 










175 




Val He 


Arg 


Ala 


Met 


Pro 


Val 


Tyr 


185 








190 






Val Val 


Lys 


Arg 


Cys 


Pro 


Asn 


His 








205 








Gly Gin 


He 


Ala 


Pro 


Pro 


Ser 


His 






220 










His Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 




235 










240 


Val Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 


250 










255 




Tyr Asn 


Phe 


Met 


Cys 


Asn 


Ser 


Ser 


265 








270 






Pro He 


Leu 


He 


He 


Val 


Thr 


Leu 








285 








Gly Arg 


Arg 


Cys 


Phe 


Glu 


Ala 


Arg 






300 










Arg Lys 


Ala 


Asp 


Glu 


Asp 


Ser 


He 




315 










320 


Thr Lys 


Asn 


Gly 


Asp 


Gly 


Thr 


Lys 


330 










335 




Gly He 


Gin 


Met 


Thr 


Ser 


He 


Lys 


345 








350 






Leu Leu 


Tyr 


Leu 


Pro 


Val 


Arg 


Gly 








365 








Lys He 


Lys 


Glu 


Ser 


Leu 


Glu 


Leu 



380 



He Glu Thr Tyr Arg Gin Gin Gin 
395 400 
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Gin Gin Gin His Gin His Leu Leu Gin Lys Gin Thr Ser lie Gin Ser 

405 410 415 

Pro Ser Ser Tyr Gly Asn Ser Ser Pro Pro Leu Asn Lys Met Asn Ser 

420 425 430 

Met Asn Lys Leu Pro Ser Val Ser Gin Leu lie Asn Pro Gin Gin Arg 

435 440 445 

Asn Ala Leu Thr Pro Thr Thr lie Pro Asp Gly Met Gly Ala Asn lie 

450 455 460 

Pro Met Met Gly Thr His Met Pro Met Ala Gly Asp Met Asn Gly Leu 
465 470 475 480 

Ser Pro Thr Gin Ala Leu Pro Pro Pro Leu Ser Met Pro Ser Thr Ser 

485 490 495 

His Cys Thr Pro Pro Pro Pro Tyr Pro Thr Asp Cys Ser lie Val Gly 

500 505 510 

Phe Leu Ala Arg Leu Gly Cys Ser Ser Cys Leu Asp Tyr. Phe Thr Thr 

515 520 525 

Gin Gly Leu Thr Thr lie Tyr Gin lie Glu His Tyr Ser Met Asp Asp 

530 535 540 

Leu Ala Ser Leu Lys lie Pro Glu Gin Phe Arg His Ala He Trp Lys 
545 550 555 560 

Gly He Leu Asp His Arg Gin Leu His Glu Phe Ser Ser Pro Ser His 

565 570 575 

Leu Leu Arg Thr Pro Ser Ser Ala Ser Thr Val Ser Val Gly Ser Ser 

580 585 590 

Glu Thr Arg Gly Glu Arg Val He Asp Ala Val Arg Phe Thr Leu Arg 

595 600 605 

Gin Thr He Ser Phe Pro Pro Arg Asp Glu Trp Asn Asp Phe Asn Phe 

610 615 620 

Asp Met Asp Ala Arg Arg Asn Lys Gin Gin Arg He Lys Glu Glu Gly 
625 630 635 * 640 

Glu 



<210> 340 
<211> 448 
<212> PRT 
<213> Homo sapiens 

<400> 340 



Met 


Ser 


Gin 


Ser 


Thr Gin Thr Asn 


1 








5 


Gin 


His 


He 


Trp 


Asp Phe Leu Glu 








20 




He 


Asp 


Leu 


Asn 


Phe Val Asp Glu 






35 




40 


Lys 


He 


Glu 


He 


Ser Met Asp Cys 




50 






55 


Ser 


Asp 


Pro 


Met 


Trp Pro Gin Tyr 


65 








70 


Met 


Asp 


Gin 


Gin 


He Gin Asn Gly 










85 


Thr 


Asp 


His 


Ala 


Gin Asn Ser Val 








100 




Pro 


Ser 


Ser 


Thr 


Phe Asp Ala Leu 






115 




120 


Asn 


Thr 


Asp 


Tyr 


Pro Gly Pro His 




130 






135 



Glu Phe Leu Ser Pro Glu Val Phe 

10 15 
Gin Pro He Cys Ser Val Gin Pro 
25 30 
Pro Ser Glu Asp Gly Ala Thr Asn 
45 

He Arg Met Gin Asp Ser Asp Leu 
60 

Thr Asn Leu Gly Leu Leu Asn Ser 

75 80 
Ser Ser Ser Thr Ser Pro Tyr Asn 

90 95 
Thr Ala Pro Ser Pro Tyr Ala Gin 
105 110 
Ser Pro S.er Pro Ala He Pro Ser 
125 

Ser Phe Asp Val Ser Phe Gin Gin. 
140 
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Ser 


Ser 


Thr 


Ala 


145 








I*ys 


Leu 




Cys 


Met 


Thr 


Pro 


Pro 








180 


Lys 


Lys 


Ala 


Glu 






195 




Glu 


Leu 


Ser 


Arg 




210 






Leu 


He 


Arg 


Val 


225 








lie 


Thr 


Glv 


Arg 


Gly 


Thr 


Glu 


Phe 








260 


Cys 


Val 


Gly 


Gly 






275 




Glu 


Thr 


Arg 


Asp 




290 






He 


Cys 


Ala 


Cys 


305 








Arg 


uys 




lain 


Arg 


Pro 


Phe 


Arg 








340 


Lys 


Arg 


Arg 


Ser 






355 




Arg 


Glu 


Thr 


Tyr 




370 






Met 


Gin 


Tyr 


Leu 


385 








Gin 


Gin 


Gin 


His 


Phe 


Arg 


Asn 


Glu 








420 


Asp 


Val 


Phe 


Phe 



435 



Lys Ser Ala 

150 
Gin He Ala 
165 

Pro Gin Gly 

His Val Thr 

Glu Phe Asn 
215 

Glu Gly Asn 

230 
Gin Ser Val 
245 

Thr Thr Val 

Met Asn Arg 

Gly Gin Val 
295 

Pro Gly Arg 

310 
Val Ser Asp 
325 

Gin Asn Thr 

Pro Asp Asp 

Glu Met Leu 
375 

Pro Gin His 

390 
Gin His Leu 
405 

Leu Val Glu 
Arg His Ser 



Thr Trp 

Lys Thr 

Ala Val 
185 
Glu Val 
200 

Glu Gly 

Ser His 

Leu Val 

Leu Tyr 
265 
Arg Pro 

280 

Leu Gly 

Asp Arg 

Ser Thr 

His Gly 
345 
Glu Leu 
360 

Leu Lys 
Thr He 

Leu Gin 

Pro Arg 
425 
Lys Pro 
440 



Thr Tyr 
155 
Cys Pro 
170 

He Arg 

Val Lys 

Gin He 

Ala Gin 
235 
Pro Tyr 
250 

Asn Phe 
He Leu 

Arg Arg 

Lys Ala 
315 
Lys Asn 
330 

He Gin 
Leu Tyr 
He Lys 

Glu Thr 

395 
Lys His 
410 

Arg Glu 
Pro Asn 



Ser Thr 

He Gin 

Ala Met 

Arg Cys 
205 
Ala Pro 
220 

Tyr Val 
Glu Pro 
Met Cys 

He He 

285 
Cys Phe 
300 

Asp Glu 

Gly Asp 

Met Thr 

Leu Pro 
365 
Glu Ser 
380 

Tyr Arg 

Leu Leu 

Thr Pro 

Arg Ser 
445 



Glu Leu Lys 
160 

He Lys Val 

175 
Pro Val Tyr 
190 

Pro Asn His 

Pro Ser His 

Glu Asp Pro 
240 

Pro Gin Val 

255 
Asn Ser Ser 
270 

Val Thr Leu 

Glu Ala Arg 

Asp Ser He 
320 

Gly Thr Lys 

335 
Ser He Lys 

350 

Val Arg Gly 

Leu Glu Leu 

Gin Gin Gin 
400 

Ser Ala Cys 

415 
Lys Gin Ser 
430 

Val Tyr Pro 



<210> 341 












<211> 356 












<212> PRT 












<213> Homo sapiens 








<400> 341 












Met Leu Tyr 


Leu 


Glu 


Asn 


Asn 


Ala 


1 




5 








Tyr Thr Asn 


Leu 


Gly Leu 


Leu 


Asn 




20 










Gly Ser Ser 


Ser 


Thr 


Ser 


Pro 


Tyr 


35 










40 


Val Thr Ala 


Pro 


Ser 


Pro Tyr Ala 


50 








55 




Leu Ser Pro 


Ser 


Pro 


Ala 


He 


Pro 


65 






70 






His Ser Phe 


Asp 


Val 


Ser 


Phe 


Gin 



85 



Gin Thr Gin Phe Ser Glu Pro Gin 

10 15 
Ser Met Asp Gin Gin He Gin Asn 
25 30 
Asn Thr Asp His Ala Gin Asn Ser 
45 

Gin Pro Ser Ser Thr Phe Asp Ala 
60 

Ser Asn Thr Asp Tyr Pro Gly Pro 

75 80 
Gin Ser Ser Thr Ala Lys Ser Ala 
90 95 
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Thr Trp Thr Tyr Ser Thr Glu Leu Lys Lys 

100 105 
Lys Thr Cys Pro lie Gin lie Lys Val Met 

115 120 
Ala Val lie Arg Ala Met Pro Val Tyr Lys 

130 135 
Glu Val Val Lys Arg Cys Pro Asn His Glu 
145 150 
Glu Gly Gin lie Ala Pro Pro Ser His Leu 
165 170 
Ser His Ala Gin Tyr Val Glu Asp Pro He 

180 185 
Leu Val Pro Tyr Glu Pro Pro Gin Val Gly 

195 200 
Leu Tyr Asn Phe Met Cys Asn Ser Ser Cys 

210 215 
Arg Pro He Leu He He Val Thr Leu Glu 
225 230 

Leu Gly Arg Arg Cys Phe Glu Ala Arg He 
245 250 
Asp Arg Lys Ala Asp Glu Asp Ser He Arg 

260 265 
Ser Thr Lys Asn Gly Asp Gly Thr Lys Arg 

275 280 
His Gly He Gin Met Thr Ser He Lys Lys 

290 295 
Glu Leu Leu Tyr Leu Pro Val Arg Gly Arg 
305 310 
Leu Lys He Lys Glu Ser Leu Glu Leu Met 
325 330 
Thr He Glu Thr Tyr Arg Gin Gin Gin Gin 

340 345 
Leu Gin Lys Gin 
355 



Leu Tyr Cys Gin He Ala 
110 

Thr Pro Pro Pro Gin Gly 
125 

Lys Ala Glu His Val Thr 
140 

Leu Ser Arg Glu Phe Asn 
155 160 
He Arg Val Glu Gly Asn 
175 

Thr Gly Arg Gin Ser Val 
190 

Thr Glu Phe Thr Thr Val 
205 

Val Gly Gly Met Asn Arg 

220 

Thr Arg Asp Gly Gin Val 
235 240 
Cys Ala Cys Pro Gly Arg 
255 

Lys Gin Gin Val Ser Asp 
270 

Pro Ser Arg Gin Asn Thr 
285 

Ser Pro Asp Asp 



Arg Arg 
300 
Glu Thr 
315 

Gin Tyr 



Tyr Glu Met Leu 
320 

Leu Pro Gin His 

335 

Gin Gin His Gin His Leu 
350 



<210> 342 
<211> 680 
<212> PRT 

<213> Homo sapiens 



<400> 342 



Met 


Asn 


Phe 


Glu 


Thr 


Ser Arg Cys 


1 








5 




Pro 


Tyr 


He 


Gin 


Arg 


Phe Val Glu 








20 






Glu 


Ser 


Tyr 


Tyr 


Arg 


Ser Thr Met 






35 






40 


Phe 


Leu 


Ser 


Pro 


Glu 


Val Phe Gin 




50 








55 


Pro 


He 


Cys 


Ser 


Val 


Gin Pro He 


65 










70 


Ser 


Glu 


Asp 


Gly 


Ala 


Thr Asn Lys 










85 




Arg 


Met 


Gin 


Asp 


Ser Asp Leu Ser 








100 






Asn 


Leu 


Gly 


Leu 


Leu 


Asn Ser Met 






115 






120 



Ala Thr Leu Gin Tyr Cys Pro Asp 

10 15 
Thr Pro Ala His Phe Ser Trp Lys 

25 30 
Ser Gin Ser Thr Gin Thr Asn Glu 
45 

His He Trp Asp Phe Leu Glu Gin 
60 

Asp Leu Asn Phe Val Asp Glu Pro 

75 80 
He Glu He Ser Met Asp Cys He 

90 95 
Asp Pro Met Trp Pro Gin Tyr Thr 
105 110 
Asp Gin Gin He Gin Asn Gly Ser 
125 
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Ser Ser Thr Ser Pro Tyr Asn Thr Asp His Ala Gin Asn Ser Val Thr 

130 135 140 

Ala. Pro Ser Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp Ala Leu Ser 
145 150 155 160 

Pro Ser Pro Ala lie Pro Ser Asn Thr Asp Tyr Pro Gly Pro His Ser 

165 170 175 

Phe Asp Val Ser Phe Gin Gin Ser Ser Thr Ala Lys Ser Ala Thr Trp 

180 185 190 

Thr Tyr Ser Thr Glu Leu Lys Lys Leu Tyr Cys Gin lie Ala Lys Thr 

195 200 205 

Cys Pro lie Gin lie Lys Val Met Thr Pro Pro Pro Gin Gly Ala Val 

210 215 220 

He Arg Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Glu Val 
225 230 235 240 

Val Lys Arg Cys Pro Asn His Glu Leu Ser Arg Glu Phe Asn Glu Gly 

245 250 255 

Gin He Ala Pro Pro Ser His Leu He Arg Val Glu Gly Asn Ser His 

260 265 270 

Ala Gin Tyr Val Glu Asp Pro lie Thr Gly Arg Gin Ser Val Leu Val 

275 280 285 

Pro Tyr Glu Pro Pro Gin Val Gly Thr Glu Phe Thr Thr Val Leu Tyr 

290 295 300 

Asn Phe Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg Pro 
305 310 315 320 

He Leu He lie Val Thr Leu Glu Thr Arg Asp Gly Gin Val Leu Gly 

325 330 335 

Arg Arg Cys Phe Glu Ala Arg He Cys Ala Cys Pro Gly Arg Asp Arg 

340 345 350 

Lys Ala Asp Glu Asp Ser He Arg Lys Gin Gin Val Ser Asp Ser Thr 

355 360 365 

Lys Asn Gly Asp Gly Thr Lys Arg Pro Phe Arg Gin Asn Thr His Gly 

370 375 380 

He Gin Met Thr Ser He Lys Lys Arg Arg Ser Pro Asp Asp Glu Leu 
385 390 395 . 400 

Leu Tyr Leu Pro Val Arg Gly Arg Glu Thr Tyr Glu Met Leu Leu Lys 

405 410 415 

He Lys Glu Ser Leu Glu Leu Met Gin Tyr Leu Pro Gin His Thr He 

420 425 430 

Glu Thr Tyr Arg Gin Gin Gin Gin Gin Gin His Gin His Leu Leu Gin 

435 440 445 

Lys Gin Thr Ser He Gin Ser Pro Ser Ser Tyr Gly Asn Ser Ser Pro 

450 455 460 

Pro Leu Asn Lys Met Asn Ser Met Asn Lys Leu Pro Ser Val Ser Gin 
465 470 475 480 

Leu He Asn Pro Gin Gin Arg Asn Ala Leu Thr Pro Thr Thr He Pro 

485 490 495 

Asp Gly Met Gly Ala Asn He Pro Met Met Gly Thr His Met Pro Met 

500 505 510 

Ala Gly Asp Met Asn Gly Leu Ser Pro Thr Gin Ala Leu Pro Pro Pro 

515 520 525 

Leu Ser Met Pro Ser Thr Ser Gin Cys Thr Pro Pro Pro Pro Tyr Pro 

530 535 540 

Thr Asp Cys Ser He Val Ser Phe Leu Ala Arg Leu Gly Cys Ser Ser 
545 550 555 560 

Cys Leu Asp Tyr Phe Thr Thr Gin Gly Leu Thr Thr He Tyr Gin He 

565 570 575 

Glu His Tyr Ser Met Asp Asp Leu Ala Ser Leu Lys He Pro Glu Gin 
580 585 590 
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Phe Arg 


His 


Ala He Trp Lys 


Gly 




595 






600 


Glu Phe 


Ser 


Ser 


Pro Ser His 


Leu 


610 






615 




Thr Val 


Ser 


Val 


Gly Ser Ser 


Glu 


625 






630 




Ala Val 


Arg 


Phe 


Thr Leu Arg 


Gin 








645 




Glu Trp 


Asn 


Asp 


Phe Asn Phe 


Asp 






660 






Gin Arg 


He 


Lys 


Glu Glu Gly 


Glu 




675 






680 



He Leu Asp His Arg Gin Leu His 
605 

Leu Arg Thr Pro Ser Ser Ala Ser 
620 

Thr Arg Gly Glu Arg Val He Asp 
635 640 
Thr He Ser Phe Pro Pro Arg Asp 

650 655 
Met Asp Ala Arg Arg Asn Lys Gin 
665 670 



<210> 34 3 
<211> 461 
<212> PRT 
<213> Homo sapiens 



<400> 343 



Met 


Leu 


Tyr 


Leu 


Glu 


Asn 


Asn 


Ala 


Gin Thr Gin Phe Ser 


Glu Pro Gin 


1 








5 








10 


15 


Tyr 


Thr 


Asn 


Leu 


Gly Leu Leu 


Asn Ser Met Asp Gin Gin 


He Gin Asn 








20 










25 


30 


Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


Pro 


Tyr Asn Thr Asp His Ala 


Gin Asn Ser 






35 


- 








40 


45 




Val 


Thr 


Ala 


Pro 


Ser 


Pro Tyr 


Ala 


Gin Pro Ser Ser Thr 


Phe Asp Ala 




50 










55 




60 




Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser Asn Thr Asp Tyr 


Pro Gly Pro 


65 










70 






75 


80 


His 


Ser 


Phe 


Asp 


Val 


Ser 


Phe 


Gin 


Gin Ser Ser Thr Ala 


Lys Ser Ala 










85 








90 


95 


Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys Lys Leu Tyr Cys 


Gin He Ala 








100 










105 


110 


Lys 


Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val Met Thr Pro Pro 


Pro Gin Gly 






115 










120 


125 




Ala 


Val 


He 


Arg 


Ala 


Met 


Pro 


Val 


Tyr Lys Lys Ala Glu 


His Val Thr 




130 










135 




140 




Glu 


Val 


Val 


Lys 


Arg Cys 


Pro 


Asn 


His Glu Leu Ser Arg 


Glu Phe Asn 


145 










150 






155 


160 


Glu 


Gly 


Gin 


He 


Ala 


Pro 


Pro 


Ser His Leu He Arg Val 


Glu Gly Asn 










165 








170 


175 


Ser 


His 


Ala 


Gin 


Tyr Val Glu 


Asp 


Pro He Thr Gly Arg 


Gin Ser Val 








180 










185 


190 


Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val Gly Thr Glu Phe 


Thr Thr Val 






195 










200 


205 




Leu 


Tyr 


Asn 


Phe 


Met 


Cys Asn 


Ser 


Ser Cys Val Gly Gly 


Met Asn Arg 




210 










215 




220 




Arg 


Pro 


He 


Leu 


He 


He 


Val 


Thr 


Leu Glu Thr Arg Asp 


Gly Gin Val 


225 










230 






235 


240 


Leu 


Gly 


Arg 


Arg 


Cys 


Phe 


Glu 


Ala Arg He Cys Ala Cys 


Pro Gly Arg 










245 








250 


255 


Asp 


Arg 


Lys 


Ala 


Asp Glu Asp 


Ser He Arg Lys Gin Gin 


Val Ser Asp 








260 










265 


270 


Ser 


Thr 


Lys 


Asn 


Gly Asp Gly 


Thr 


Lys Arg Pro Phe Arg 


Gin Asn Thr 






275 










280 


285 




His 


Gly 


He 


Gin 


Met 


Thr 


Ser 


He Lys Lys Arg Arg Ser 


Pro Asp Asp 




290 










295 




300 





wo 02/00174 



151 



PCT/USOl/21065 



Glu 


Leu 


Leu Tyr Leu 


Pro 


Val 


Arg 


Gly 


Arg 


Glu 


Thr 


Tyr 


Glu 


Met 


Leu 


305 






310 










315 








320 


Leu 


Lys 


lie Lys Glu 


Ser 


Leu 


Glu 


Leu 


Met 


Gin 


Tyr 


Leu 


Pro 


Gin 


His 






325 










330 










335 




Thr 


lie 


Glu Thr Tyr 


Arg 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


His 


Gin 


His 


Leu 






340 








345 










350 






Leu 


Gin 


Lys Gin Thr 


Ser 


He 


Gin 


Ser 


Pro 


Ser 


Ser 


Tyr 


Gly 


Asn 


Ser 






355 






360 










365 






Ser 


Pro 


Pro Leu Asn 


Lys 


Met 


Asn 


Ser 


Met 


Asn 


Lys 


Leu 


Pro 


Ser 


Val 




370 






375 










380 










Ser 


Gin 


Leu lie Asn 


Pro 


Gin 


Gin 


Arg 


Asn 


Ala 


Leu 


Thr 


Pro 


Thr 


Thr 


385 






390 










395 










400 


lie 


Pro 


Asp Gly Met 


Gly Ala 


Asn 


He 


Pro 


Met 


Met 


Gly 


Thr 


His 


Met 






405 










410 








415 




Pro 


Met 


Ala Gly Asp 


Met 


Asn 


Gly 


Leu 


Ser 


Pro 


Thr 


Gin 


Ala 


Leu 


Pro 






420 








425 










430 






Pro 


Pro 


Leu Ser Met 


Pro 


Ser 


Thr 


Ser 


His 


Cys 


Thr 


Pro 


Pro 


Pro 


Pro 






435 






440 










445 








Tyr 


Pro 


Thr Asp Cys 


Ser 


He 


Val 


Arg 


He 


Trp 


Gin 


Val 









450 455 460 



<210> 344 
<211> 516 

<212> PRT 

<213> Homo sapiens 



<400> 344 



Met 


Ser 


Gin 


Ser 


Thr 


Gin Thr 


Asn 


Glu 


Phe 


Leu 


Ser 


Pro 


Glu 


Val 


Phe 


1 








5 








10 










15 




Gin 


His 


He 


Trp 


Asp 


Phe Leu 


Glu 


Gin 


Pro 


He 


Cys 


Ser 


Val 


Gin 


Pro 








20 








25 










30 






He 


Asp 


Leu 


Asn 


Phe 


Val Asp 


Glu 


Pro 


Ser 


Glu 


Asp 


Gly 


Ala 


Thr 


Asn 






35 








40 










45 








Lys 


He 


Glu 


He 


Ser 


Met Asp 


Cys 


He 


Arg 


Met 


Gin 


Asp 


Ser 


Asp 


Leu 




50 








55 










60 










Ser 


Asp 


Pro 


Met 


Trp 


Pro Gin 


Tyr 


Thr 


Asn 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser 


65 










70 








75 










80 


Met 


Asp 


Gin 


Gin 


He 


Gin Asn 


Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


Pro 


Tyr 


Asn 










85 








90 










95 




Thr 


Asp 


His 


Ala 


Gin 


Asn Ser 


Val 


Thr 


Ala 


Pro 


Ser 


Pro 


Tyr 


Ala 


Gin 








100 








105 










110 






Pro 


Ser 


Ser 


Thr 


Phe 


Asp Ala 


Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 






115 








120 










125 








Asn 


Thr 


Asp 


Tyr 


Pro 


Gly Pro 


His 


Ser 


Phe 


Asp 


Val 


Ser 


Phe 


Gin 


Gin 




130 








i35 










-140 










Ser 


Ser 


Thr 


Ala 


Lys 


Ser Ala 


Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys 


145 










150 








155 










160 


Lys 


Leu 


Tyr 


Cys 


Gin 


He Ala 


Lys 


Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val 










165 








170 










175 




Met 


Thr 


Pro 


Pro 


Pro 


Gin Gly 


Ala 


Val 


He 


Arg 


Ala 


Met 


Pro 


Val 


Tyr 








180 








185 










190 




Lys 


Lys 


Ala 


Glu 


His 


Val Thr 


Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro 


Asn 


His 






195 








200 










205 








Glu 


Leu 


Ser 


Arg 


Glu 


Phe Asn 


Glu 


Gly 


Gin 


He 


Ala 


Pro 


Pro 


Ser 


His 




210 








215 










220 










Leu 


He 


Arg 


Val 


Glu 


Gly Asn 


Ser 


flis 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 


225 










230 








235 








240 
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lie Thr 


Gly Arg Gin 


Ser 


Val 


Leu Val Pro Tyr Glu 


Pro Pro Gin Val 




245 






250 


255 


Gly Thr 


Glu Phe Thr 


Thr Val Leu Tyr Asn Phe Met 


Cys Asn Ser Ser 


260 






265 


270 


Cys Val 


Gly Gly Met 


Asn Arg Arg Pro lie Leu lie 


He Val Thr Leu 


275 






280 


285 


Glu Thr 


Arg Asp Gly 


Gin 


Val 


Leu Gly Arg Arg Cys. 


Phe Glu Ala Arg 


290 






295 


300 




lie Cys 


Ala Cys Pro 


Gly Arg Asp Arg Lys Ala Asp 


Glu Asp Ser He 


305 




310 




315 


320 


Arg Lys 


Gin Gin Val 


Ser Asp Ser Thr Lys Asn Gly 


Asp Gly Thr Lys 


325 






330 


335 


Arg Pro 


Phe Arg Gin 


Asn Thr His Gly lie Gin Met 


Thr Ser He Lys 


340 






345 


350 


Lys Arg 


Arg Ser Pro 


Asp Asp 


Glu Leu Leu Tyr Leu 


Pro Val Arg Gly 


355 






360 


365 


Arg Glu 


Thr Tyr Glu 


Met 


Leu Leu Lys lie Lys Glu 


Ser Leu Glu Leu 


370 






375 


380 




Met Gin 


Tyr Leu Pro 


Gin 


His 


Thr lie Glu Thr Tyr 


Arg Gin Gin Gin 


385 


390 




395 


400 


Gin Gin 


Gin His Gin 


His 


Leu 


Leu Gin Lys Gin Thr 


Ser He Gin Ser 




405 






410 


415 


Pro Ser 


Ser Tyr Gly 


Asn 


Ser 


Ser Pro Pro Leu Asn 


Lys Met Asn Ser 




420 






425 


430 


Met Asn 


Lys Leu -Pro 


Ser 


Val 


Ser Gin Leu He Asn 


Pro Gin Gin Arg 




435 






440 


445 


Asn Ala 


Leu Thr Pro 


Thr 


Thr 


He Pro Asp Gly Met 


Gly Ala Asn He 


450 






455 


4 60 




Pro Met 


Met Gly Thr 


His 


Met 


Pro Met Ala Gly Asp 


Met Asn Gly Leu 


465 


470 




475 


480 


Ser Pro 


Thr Gin Ala 


Leu 


Pro 


Pro Pro Leu Ser Met 


Pro Ser Thr Ser 




485 






490 


495 


His Cys 


Thr Pro Pro 


Pro 


Pro 


Tyr Pro Thr Asp Cys 


Ser He Val Arg 


500 






505 


510 


lie Trp 


Gin Val 












515 











<210> 345 

<211> 1800 

<212> DNA 

<213> Homo sapiens 

<400> 345 

gcgcctcatt gccactgcag tgactaaagc 
actggttgtt ttttaaacaa attctgatac 
tgacattcgt atcatcactg tgcaccattg 
ggaggtctga aaccctcgca gagggatctt 
agtcgttgga aacaggactc agggataaac 
tttcatcggg ggtgtcaaca aacactccac 
ctttattttc cgagtcatga tcctagtggt 
agaggacttc gtctgcaaca cactgcaacc 
tttcccggtg tcccacatcc ggctgtgggc 
gctgctggtg gccatgcatg tggcctacta 
aggagagaag aggaatgatt tcaaagacat 
agaggggtcg ctgtggtgga cgtacaccag 
agcctttatg tatgtgtttt acttccttta 
atgtgggatt gacccctgcc ccaaccttgt 



tgggaagacg ctggtcagtt cacctgcccc 60 

aggcgacatc ctcactgacc gagcaaagat 120 
gcttctaggc actccagtgg ggtaggagaa 180 
gccctcattc tttgggtctg aaacactggc 240 
cagcgcaatg gattggggga cgctgcacac 300 
cagcatcggg aaggtgtgga tcacagtcat 360 
ggctgcccag gaagtgtggg gtgacgagca 420 
gggatgcaaa aatgtgtgct atgaccactt 480 
cctccagctg atcttcgtct ccaccccagc 540 
caggcacgaa accactcgca agttcaggcg 600 
agaggacatt aaaaagcaca aggttcggat 660 
cagcatcttt ttccgaatca tctttgaagc 720 
caatgggtac cacctgccct gggtgttgaa 780 
tgactgcttt atttctaggc caacagagaa 840 
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gaccgtgttt accattttta tgatttctgc gtctgtgatt tgcatgctgc ttaacgtggc 900 
agagttgtgc tacctgctgc tgaaagtgtg ttttaggaga tcaaagagag cacagacgca 960 
aaaaaatcac cccaatcatg ccctaaagga gagtaagcag aatgaaatga atgagctgat 1020 
ttcagatagt ggtcaaaatg caatcacagg tttcccaagc taaacatttc aaggtaaaat 1080 
gtagctgcgt cataaggaga cttctgtctt ctccagaagg caataccaac ctgaaagttc 1140 
cttctgtagc ctgaagagtt tgtaaatgac tttcataata aatagacact tgagttaact 1200 
ttttgtagga tacttgctcc attcatacac aacgtaatca aatatgtggt ccatctctga 1260 
aaacaagaga ctgcttgaca aaggagcatt gcagtcactt tgacaggttc cttttaagtg 1320 
gactctctga caaagtgggt actttctgaa aatttatata actgttgttg ataaggaaca 1380 
tttatccagg aattgatacg tttattagga aaagatattt ttataggctt ggatgttttt 1440 
agttccgact ttgaatttat ataaagtatt tttataatga ctggtcttcc ttacctggaa 1500 
aaacatgcga tgttagtttt agaattacac cacaagtatc taaatttcca acttacaaag 1560 
ggtcctatct tgtaaatatt gttttgcatt gtctgttggc aaatttgtga actgtcatga 1620 
tacgcttaag gtgggaaagt gttcattgca caatatattt ttactgcttt ctgaatgtag 1680 
acggaacagt gtggaagcag aaggcttttt taactcatcc gtttggccga tcgttgcaga 1740 
ccactgggag atgtggatgt ggttgcctcc ttttgctcgt ccccgtggct taacccttct 1800 



<210> 346 
<211> 261 
<212> PRT 

<213> Homo sapiens 
<400> 346 



Met Asp 


Trp 


Gly 


Thr 


Leu 


His 


Thr 


Phe 


He 


Gly Gly 


Val 


Asn Lys 


His 


1 






5 










10 






15 




Ser Thr 


Ser 


He 


Gly 


Lys 


Val 


Trp 


He 


Thr 


Val He 


Phe 


He Phe 


Arg 






20 










25 








30 


Val Met 


He 


Leu 


Val 


Val 


Ala 


Ala 


Gin 


Glu 


Val Trp 


Gly 


Asp Glu 


Gin 




35 










40 








45 






Glu Asp 


Phe 


Val 


Cys 


Asn 


Thr 


Leu 


Gin 


Pro 


Gly Cys 


Lys 


Asn Val 


Cys 


50 








55 








60 








Tyr Asp 


His 


Phe 


Phe 


Pro 


Val 


Ser 


His 


He 


Arg Leu 


Trp 


Ala Leu 


Gin 


65 








70 










75 




80 


Leu lie 


Phe 


Val 


Ser 


Thr 


Pro 


Ala 


Leu 


Leu 


Val Ala 


Met 


His Val 


Ala 








85 










90 






95 




Tyr Tyr 


Arg 


His 


Glu 


Thr 


Thr 


Arg 


Lys 


Phe 


Arg Arg 


Gly 


Glu Lys 


Arg 






100 










105 








110 




Asn Asp 


Phe 


Lys 


Asp 


He 


Glu 


Asp 


He 


Lys 


Lys His 


Lys 


Val Arg 


He 




115 










120 








125 






Glu Gly 


Ser 


Leu 


Trp 


Trp 


Thr 


Tyr 


Thr 


Ser 


Ser He 


Phe 


Phe Arg 


He 


130 










135 








140 








lie Phe 


Glu 


Ala 


Ala 


Phe 


Met 


Tyr 


Val 


Phe 


Tyr Phe 


Leu 


Tyr Asn 


Gly 


145 








150 










155 






160 


Tyr His 


Leu 


Pro 


Trp 


Val 


Leu 


Lys 


Cys 


Gly 


He Asp 


Pro 


Cys Pro 


Asn 








165 










170 






175 




Leu Val 


Asp 


Cys 


Phe 


He 


Ser 


Arg 


Pro 


Thr 


Glu Lys 


Thr 


Val Phe 


Thr 






180 










185 








190 




He Phe 


Met 


He 


Ser 


Ala 


Ser 


Val 


He 


Cys 


Met Leu 


Leu 


Asn Val 


Ala 




195 










200 








205 






Glu Leu 


Cys 


Tyr 


Leu 


Leu 


Leu 


Lys 


Val 


Cys 


Phe Arg 


Arg 


Ser Lys 


Arg 


210 










215 








220 


Ala Gin 


Thr 


Gin 


Lys 


Asn 


His 


Pro 


Asn 


His 


Ala Leu 


Lys 


Glu Ser 


Lys 


225 








230 










235 




240 


Gin Asn 


Glu 


Met 


Asn 


Glu 


Leu 


He 


Ser 


Asp 


Ser Gly 


Gin 


Asn Ala 


He 








245 










250 






255 




Thr Gly 


Phe 


Pro 


Ser 





















260 
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<210> 347 

<211> 1740 

<212> DNA 

<213> Homo sapiens 



<400> 347 

atgaacaaac 

atcttcaagg 

ttcgtggact 

atagaactgc 

cggaaacttc 

ctagtccagt 

gttgtaaatg 

ggatttcagt 

cagcaaaacc 

aggcaggggt 

ctggttccca 

atcaccaaac 

gagaagtcga 

ctggagatta 

attttagctc 

aaaattgagc 

tataatccag 

gaggagatca 

caagcacatt 

gggatgccac 

gagcaatcag 

atcggcaagc 

attgctccag 

gaggctcagt 

agtcctaaag 

agagttattg 

gttgttgtcc 

ggtcacttct 

aagcagcacc 



tgtatatcgg 
acgccaagat 
gcccggacga 
acgggaaacc 
agatacgaaa 
atggagtggt 
taacctattc 
tagagaattt 
ccttgcagca 
ctccaggatc 
cccaatttgt 
agacccagtc 
ttactatcct 
tgcataagga 
ataataactt 
aagacacaga 
aacgcactat 
tgaagaaaat 
taattcctgg 
ctcccacctc 
aaacggagac 
agggccagca 
cggaagcacc 
tcaaggctca 
aagaggtgaa 
gaaaaggagg 
ctcgtgacca 
atgcttgcca 
aacaacagaa 



aaacctcagc 
cccggtgtcg 
gagctgggcc 
catagaagtt 
tatcccgcct 
ggagagctgt 
cagtaaggac 
caccttgaaa 
gccccgaggt 
cgtatccaag 
tggagccatc 
taaaatcgat 
ctctactcct 
agctcaagat 
tgttggacgt 
cactaaaatc 
tacagttaaa 
cagggagtct 
attaaatctg 
agggccccct 
tgttcatctg 
catcaagcag 
agatgctaaa 
gggaagaatt 
acttgaagct 
caaaacggtg 
gacacctgat 
ggttgcccag 
ggctctgcaa 



gagaacgccg 
ggacccttcc 
ctcaaggcca 
gage act egg 
catttacagt 
gagcaagtga 
caagctagae 
gtagcctata 
cgccgggggc 
cagaaaccat 
ataggaaaag 
gtccaccgta 
gaaggcacet 
ataaaattca 
cttattggta 
acgatatctc 
ggcaatgttg 
tatgaaaatg 
aacgccttgg 
teagccatga 
tttateecag 
ctttctcgct 
gtgaggatgg 
tatggaaaaa 
catateagag 
aatgaacttc 
gagaatgaee 
agaaaaattc 
agtggaecac 



ccccctcgga 

tggtgaagac 

tcgaggcgct 

teccaaaaag 

gggaggtget 

aeactgaetc 

aagcactaga 

tccctgatga 

ttgggcagag 

gtgatttgcc 

aaggtgccae 

aagaaaatgc 

ctgeggcttg 

cagaagagat 

aagaaggaag 

cattgcagga 

agacatgtgc 

atattgcttc 

gtctgttccc 

ctcctcceta 

ctetatcagt 

ttgctggage 

tgattatcac 

ttaaagaaga 

tgccatcctt 

agaatttgtc 

aagtggttgt 

aggaaattct 

cteagtcaag 



cctagaaagt 

tggctacgcg 

ttcaggt.aaa 

gcaaaggatt 

ggatagttta 

ggaaactgca 

caaactgaat 

aacggccgcc 

gggctcctca 

tctgcgcctg 

cattcggaac 

gggggctgct 

taagtctatt 

ccccttgaag 

aaatcttaaa 

attgaegctg 

caaagetgag 

tatgaatett 

acccacttca 

cccgcagttt 

cggtgccatc 

ttcaattaag 

tggaccacca 

aaactttgtt 

tgctgctggc 

aagtgcagaa 

eaaaataaet 

gactcaggta 

acggaagtaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 



<210> 348 

<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 348 

Met Asn Lys Leu Tyr lie Gly Asn Leu Ser 

15 10 
Asp Leu Glu Ser lie Phe Lys Asp Ala Lys 

20 25 
Phe Leu Val Lys Thr Gly Tyr Ala Phe Val 

35 40 
Trp Ala Leu Lys Ala lie Glu Ala Leu Ser 

50 55 
Gly Lys Pro lie Glu Val Glu His Ser Val 
65 70 

Arg Lys Leu Gin lie Arg Asn lie Pro Pro 

85 90 
Leu Asp Ser Leu Leu Val Gin Tyr Gly Val 



Glu Asn Ala Ala 
lie Pro 
Asp Cys 



Gly Lys 

60 
Pro Lys 
75 

His Leu 
Val Glu 



Val Ser 

30 
Pro Asp 
45 

lie Glu 



Arg Gin 
Gin Trp 
Ser Cys 



Pro Ser 
15 

Gly Pro 

Glu Ser 

Leu His 

Arg He 

80 
Glu Val 
95 

Glu Gin 
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100 


105 








110 






Val Asn 


Thr 


Asp 


Ser Glu Thr Ala Val Val 


Asn 


Val 


Thr 


Tyr 


Ser 


Ser 




115 




120 






125 






Lys Asp 


Gin 


Ala 


Arg Gin Ala Leu Asp Lys 


Leu 


Asn 


Gly 


Phe 


Gin 


Leu 


130 






135 




140 








Glu Asn 


Phe 


Thr 


Leu Lys Val Ala Tyr He 


Pro 


Asp 


Glu 


Thr 


Ala 


Ala 


145 






150 


155 








160 


Gin Gin 


Asn 


Pro 


Leu Gin Gin Pro Arg Gly 


Arg 


Arg 


Gly 


Leu 


Gly 


Gin 








165 170 










175 




Arg Gly 


Ser 


Ser 


Arg Gin Gly Ser Pro Gly 


Ser 


Val 


Ser 


Lys 


Gin 


Lys 






180 


185 








190 




Pro Cys 


Asp 


Leu 


Pro Leu Arg Leu Leu Val 


Pro 


Thr 


Gin 


Phe 


Val 


Gly 




195 




200 






205 






Ala lie 


He 


Gly 


Lys Glu Gly Ala Thr He 


JVrg 


Asn 


He 


Thr 


Lys 


Gin 


210 






215 




220 








Thr Gin 


Ser 


Lys 


He Asp Val His Arg Lys 


Glu 


Asn 


Ala 


Gly 


Ala 


Ala 


225 






230 


235 








240 


Glu Lys 


Ser 


He 


Thr He Leu Ser Thr Pro 


Glu 


Gly 


Thr 


Ser 


Ala 


Ala 








245 250 








255 




Cys Lys 


Ser 


He 


Leu Glu He Met His Lys 


Glu 


Ala 


Gin 


Asp 


He 


Lys 






260 


265 








270 




Phe Thr 


Glu 


Glu 


He Pro Leu Lys He Leu 


Ala 


His 


Asn 


Asn 


Phe 


Val 




275 




280 






285 








Gly Arg 


Leu 


He 


Gly Lys Glu Gly Arg Asn 


Leu 


Lys 


Lys 


He 


Glu 


Gin 


290 






295 




300 








Asp Thr 


Asp 


Thr' 


Lys He Thr He Ser Pro 


Leu 


Gin 


Glu 


Leu 


Thr 


Leu 


305 






310 


315 










320 


Tyr Asn 


Pro 


Glu 


Arg Thr He Thr Val Lys 


Gly 


Asn 


Val 


Glu 


Thr 


Cys 








325 330 










335 


Ala Lys 


Ala 


Glu 


Glu Glu He Met Lys Lys 


He 


Arg 


Glu 


Ser 


Tyr 


Glu 






340 


345 








350 




Asn Asp 


He 


Ala 


Ser Met Asn Leu Gin Ala 


His 


Leu 


He 


Pro 


Gly 


Leu 




355 




360 






365 






Asn Leu 


Asn 


Ala 


Leu Gly Leu Phe Pro Pro 


Thr 


Ser 


Gly 


Met 


Pro 


Pro 


370 






375 




380 








Pro Thr 


Ser 


Gly 


Pro Pro Ser Ala Met Thr 


Pro 


Pro 


Tyr 


Pro 


Gin 


Phe 


385 






390 


395 








400 


Glu Gin 


Ser 


Glu 


Thr Glu Thr Val His Leu 


Phe 


He 


Pro 


Ala 


Leu 


Ser 








405 410 










415 




Val Gly 


Ala 


He 


He Gly Lys Gin Gly Gin 


His 


He 


Lys 


Gin 


Leu 


Ser 






420 


425 








430 






Arg Phe 


Ala 


Gly 


Ala Ser He Lys He Ala 


Pro 


Ala 


Glu 


Ala 


Pro 


Asp 




435 




440 






445 






Ala Lys 


Val 


Arg 


Met Val He He Thr Gly 


Pro 


Pro 


Glu 


Ala 


Gin 


Phe 


450 






455 




460 










Lys Ala 


Gin 


Gly 


Arg He Tyr Gly Lys He 


Lys 


Glu 


Glu 


Asn 


Phe 


Val 


465 






470 


475 










480 


Ser Pro 


Lys 


Glu 


Glu Val Lys Leu Glu Ala 


His 


He 


Arg 


Val 


Pro 


Ser 








485 490 








495 




Phe Ala 


Ala 


Gly 


Arg Val He Gly Lys Gly 


Gly 


Lys 


Thr 


Val 


Asn 


Glu 






500 


505 








510 






Leu Gin 


Asn 


Leu 


Ser Ser Ala Glu Val Val 


Val 


Pro 


Arg 


Asp 


Gin 


Thr 




515 




520 






525 






Pro Asp 


Glu 


Asn 


Asp Gin Val Val Val Lys 


He 


Thr 


Gly 


His 


Phe 


Tyr 


530 






535 




540 








Ala Cys 


Gin 


Val 


Ala Gin Arg Lys He Gin 


Glu 


He 


Leu 


Thr 


Gin 


Val 


545 






550 


555 










560 


Lys Gin 


His 


Gin 


Gin Gin Lys Ala Leu Gin 


Ser 


Gly 


Pro 


Pro 


Gin 


Ser 
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565 570 575 

Arg Arg Lys 



<210> 349 
<211> 207 
<212> DNA 
<213> Homo sapiens 

<400> 349 

atgtggcagc ccctcttctt caagtggctc ttgtcctgtt gccctgggag ttctcaaatt 60 

gctgcagcag cctccaccca gcctgaggat gacatcaata cacagaggaa gaagagtcag 120 

gaaaagatga gagaagttac agactctcct gggcgacccc gagagcttac cattcctcag 180 
acttcttcac atggtgctaa cagattt 207 

<210> 350 
<211> 69 
<212> PRT 

<213> Homo sapiens 
<400> 350 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

15 10 15 

Ser Ser Gin lie Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp lie 

20 25 30 

Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp 

35 40 45 

Ser Pro Gly Arg Pro Arg Glu Leu Thr lie Pro Gin Thr Ser Ser His 

50 55 60 

Gly Ala Asn Arg Phe 
65 



<210> 351 
<211> 1012 

<212> DNA 

<213> Homo sapiens 
<400> 351 

ccctctagaa ataattttgt ttaactttaa gaaggagata tacatatgca tcaccatcac 60 

catcacacgg ccgcgtccga taacttccag ctgtcccagg gtgggcaggg attcgccatt 120 

ccgatcgggc aggcgatggc gatcgcgggc cagatcaagc ttcccaccgt tcatatcggg 180 

cctaccgcct tcctcggctt gggtgttgtc gacaacaacg gcaacggcgc acgagtccaa 240 

cgcgtggtcg ggagcgctcc ggcggcaagt ctcggcatct ccaccggcga cgtgatcacc 300 

gcggtcgacg gcgctccgat caactcggcc accgcgatgg cggacgcgct taacgggcat 360 

catcccggtg acgtcatctc ggtgacctgg caaaccaagt cgggcggcac gcgtacaggg 420 

aacgtgacat tggccgaggg acccccggcc gaattcatgg attgggggac gctgcacact 4 80 

ttcatcgggg gtgtcaacaa acactccacc agcatcggga ^aggtgtggat cacagtcatc 540 

tttattttcc gagtcatgat cctcgtggtg gctgcccagg aagtgtgggg tgacgagcaa 600 

gaggacttcg tctgcaacac actgcaaccg ggatgcaaaa atgtgtgcta tgaccacttt 660 

ttcccggtgt cccacatccg gctgtgggcc ctccagctga tcttcgtctc caccccagcg 720 

ctgctggtgg ccatgcatgt ggcctact'ac aggcacgaaa ccactcgcaa gttcaggcga 780 

ggagagaaga ggaatgattt caaagacata gaggacatta aaaagcagaa ggttcggata 840 

gaggggtgac tcgagcacca ccaccaccac cactgagatc cggctgctaa caaagcccga 900 

aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc ccttggggcc 960 

tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg at 1012 
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<210> 352 
<211> 267 
<212> PRT 
<213> Homo sapiens 

<400> 352 

Met His His His His His His Thr Ala Ala Ser Asp Asn Phe Gin Leu 

15 10 15 

Ser Gin Gly Gly Gin Gly Phe Ala lie Pro lie Gly Gin Ala Met Ala 

20 25 30 

He Ala Gly Gin He Lys Leu Pro Thr Val His He Gly Pro Thr Ala 

35 40 45 

Phe Leu Gly Leu Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val 

50 55 60 

Gin Arg Val Val Gly Ser Ala Pro Ala Ala Ser Leu Gly He Ser Thr 
65 70 75 80' 

Gly Asp Val He Thr Ala Val Asp Gly Ala Pro He Asn Ser Ala Thr 

85 90 95 

Ala Met Ala Asp Ala Leu Asn Gly His His Pro Gly J\sp Val He Ser 

100 105 110 

Val Thr Trp Gin Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr • 

115 120 125 

Leu Ala Glu Gly Pro Pro Ala Glu Phe Met Asp Trp Gly Thr Leu His 

130 135 140 

Thr Phe He Gly Gly Val Asn Lys His Ser Thr Ser He Gly Lys Val 
145 150 155 160 

Trp He Thr Val He Phe He Phe Arg Val Met He Leu Val Val Ala 

165 170 175 

Ala Gin Glu Val Trp Gly Asp Glu Gin Glu Asp Phe Val Cys Asn Thr 

180 185 190 

Leu Gin Pro Gly Cys Lys Asn Val Cys Tyr Asp His Phe Phe Pro Val 

195 200 205 

Ser His He Arg Leu Trp Ala Leu Gin Leu He Phe Val Ser Thr Pro 

210 215 220 

Ala Leu Leu Val Ala Met His Val Ala Tyr Tyr Arg His Glu Thr Thr 
225 230 235 240 

Arg Lys Phe Arg Arg Gly Glu Lys Arg Asn Asp Phe Lys Asp He Glu 

245 250 255 

Asp He Lys Lys Gin Lys Val Arg He Glu Gly 
260 265 



<210> 353 
<211> 900 
<212> DNA 
<213> Homo sapiens 

<400> 353 

atgcatcacc atcaccatca cacggccgcg 
cagggattcg ccattccgat cgggcaggcg 
accgttcata tcgggcctac cgccttcctc 
ggcgcacgag tccaacgcgt ggtcgggagc 
ggcgacgtga tcaccgcggt cgacggcgct 
gcgcttaacg ggcatcatcc cggtgacgtc 
ggcacgcgta cagggaacgt gacattggcc 
actcgcaagt tcaggcgagg agagaagagg 
aagcagaagg ttcggataga ggggtcgctg 
cgaatcatct ttgaagcagc ctttatgtat 



tccgataact tccagctgtc ccagggtggg 60 
atggcgatcg cgggccagat caagcttccc 120 
ggcttgggtg ttgtcgacaa caacggcaac 180 
gctccggcgg caagtctcgg catctccacc 240 
ccgatcaact cggccaccgc gatggcggac 300 
atctcggtga cctggcaaac caagtcgggc 360 
gagggacccc cggccgaatt ccacgaaacc 420 
aatgatttca aagacataga ggacattaaa 480 
tggtggacgt acaccagcag catctttttc 540 
gtgttttact tcctttacaa tgggtaccac 600 
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ctgccctggg tgttgaaatg tgggattgac 
tctaggccaa cagagaagac cgtgtttacc 
atgctgctta acgtggcaga gttgtgctac 
aagagagcac agacgcaaaa aaatcacccc 
gaaatgaatg agctgatttc agatagtggt 



ccctgcccca accttgttga ctgctttatt 660 

atttttatga tttctgcgtc tgtgatttgc 720 

ctgctgctga aagtgtgttt taggagatca 780 

aatcatgccc taaaggagag taagcagaat 840 

caaaatgcaa tcacaggttt cccaagctaa 900 



<210> 354 
<211> 299 
<212> PRT 
<213> Homo sapiens 



<400> 354 



Met 


His 


His His His His 


His 


Thr 


1 




5 






Ser 


Gin 


Gly Gly Gin Gly 


Phe 


Ala 






20 






He 


Ala 


Gly Gin He Lys 


Leu 


Pro 






35 




40 


Phe 


Leu 


Gly Leu Gly Val 


Val 


.Asp 




50 




55 




Gin 


Arg 


Val Val Gly Ser 


Ala 


Pro 


65 




70 






Gly 


Asp 


Val He Thr Ala 


Val 


Asp 






85 






Ala 


Met 


Ala Asp Ala Leu 


Asn 


Gly 






100 






Val 


Thr 


Trp Gin Thr Lys 


Ser 


Gly 






115 




120 


Leu 


Ala 


Glu Gly Pro Pro 


Ala 


Glu 




130 




135 




Arg 


Arg 


Gly Glu Lys Arg 


Asn 


Asp 


145 




150 






Lys 


Gin 


Lys Val Arg He 


Glu 


Gly 






165 






Ser 


He 


Phe Phe Arg He 


He 


Phe 






180 






Tyr 


Phe 


Leu Tyr Asn Gly 


Tyr 


His 






195 




200 


lie 


Asp 


Pro Cys Pro Asn 


Leu 


Val 




210 




215 




Glu 


Lys 


Thr Val Phe Thr 


He 


Phe 


225 




230 






Met 


Leu 


Leu Asn Val Ala 


Glu 


Leu 






245 






Phe 


Arg 


Arg Ser Lys Arg 


Ala 


Gin 






260 






Ala 


Leu 


Lys Glu Ser Lys 


Gin 


Asn 






275 




280 


Ser 


Gly 


Gin Asn Ala He 


Thr 


Gly 




290 




295 





Ala 


Ala 


Ser 


Asp 


Asn 


Phe 


Gin 


Leu 




10 










15 




He 


Pro 


He 


Gly 


Gin 


Ala 


Met 


Ala 


25 










30 






Thr 


Val 


His 


He 


Gly 


Pro 


Thr 


Ala 










45 








Asn 


Asn 


Gly 


Asn 


Gly 


Ala 


Arg 


Val 








60 










Ala 


Ala 


Ser 


Leu 


Gly 


He 


Ser 


Thr 






75 










80 


Gly 


Ala 


Pro 


He 


Asn 


Ser 


Ala 


Thr 




90 










95 




His 


His 


Pro 


Gly 


Asp 


Val 


He 


Ser 


105 










110 






Gly 


Thr 


Arg 


Thr 


Gly 


Asn 


Val 


Thr 










125 








Phe 


His 


Glu 


Thr 


Thr 


Arg 


Lys 


Phe 








140 










Phe 


Lys* 


Asp 


He 


Glu 


Asp 


He 


Lys 






155 










160 


Ser 


Leu 


Trp 


Trp 


Thr 


Tyr 


Thr 


Ser 




170 










175 




Glu 


Ala 


Ala 


Phe 


Met 


Tyr 


Val 


Phe 


185 










190 






Leu 


Pro 


Trp 


Vai 


Leu 


Lys 


Cys 


Gly 










205 








Asp 


Cys 


Phe 


He 


Ser 


Arg 


Pro 


Thr 








220 










Met 


He 


Ser 


Ala 


Ser 


Val 


He 


Cys 






235 










240 


Cys 


Tyr 


Leu 


Leu 


Leu 


Lys 


Val 


Cys 




250 










255 




Thr 


Gin 


Lys 


Asn 


His 


Pro 


Asn 


His 


265 










270 






Glu 


Met 


Asn 


Glu 


Leu 


He 


Ser 


Asp 



285 



Phe Pro Ser 



<210> 355 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PGR primer 

<400> 355 

ggagtacagc ttcaagacaa tggg 

<210> 356 
<211> 31 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 356 

ccatgggaat tcattataat aattttgttc c 

<210> 357 
<211> 920 
<212> PRT 

<213> Homo sapiens 



<400> 357 



Met 


Gin 


His 


His 


His 


His 


His 


His 


Gly 


1 








5 










Tyr 


Asn 


Gly 


Leu 


Leu 


He 


Ala 


He 


Asn 








20 










25 


Asn 


Leu 


He 


Ser 


Asn 


He 


Lys 


Glu 


Met 






35 










40 




Leu 


Phe 


Asn 


Ala 


Thr 


Lys 


Arg 


Arg 


Val 




50 










55 






Leu 


lie 


Pro 


Ala 


Thr 


Trp 


Lys 


Ala 


Asn 


65 










70 








Glu 


Ser 


Tyr 


Glu 


Lys 


Ala 


Asn 


Val 


He 










85 










His 


Gly 


Asp 


Asp 


Pro 


Tyr 


Thr 


Leu 


Gin 








100 










105 


Gly 


Lys 


Tyr 


He 


His 


Phe 


Thr 


Pro 


Asn 






115 










120 




Thr 


Ala 


Gly 


Tyr 


Gly 


Ser 


Arg 


Gly 


Arg 




130 










135 






His 


Leu 


Arg 


Trp 


Gly 


Val 


Phe 


Asp 


Glu 


145 










150 








Tyr 


He 


Asn 


Gly 


Gin 


Asn 


Gin 


He 


Lys 










165 










He 


Thr 


Gly 


He 


Phe 


Val 


Cys 


Glu 


Lys 








180 










185 


Cys 


He 


He 


Ser 


Lys 


Leu 


Phe 


Lys 


Glu 






195 










200 




Ser 


Thr 


Gin 


Asn 


Ala 


Thr 


Ala 


Ser 


He 




210 










215 






Ser 


Val 


Val 


Glu 


Phe 


Cys 


Asn 


Ala 


Ser 


225 










230 








Asn 


Leu 


Gin 


Asn 


Gin 


Met 


Cys 


Ser 


Leu 










245 










Thr 


Asp 


Ser 


Ala 


Asp 


Phe 


His 


His 


Ser 








260 










265 


Leu 


Pro 


Pro 


Pro 


Pro 


Thr 


Phe 


Ser 


Leu 



PCT/USOl/21065 

159 



24 



31 



Val 


Gin 


Leu 


Gin 


Asp 


Asn Gly 


10 










15 


Pro 


Gin 


Val 


Pro 


Glu 


Asn Gin 










30 




He 


Thr 


Glu 


Ala 


Ser 


Phe Tyr 








45 






Phe 


Phe 


Arg 


Asn 


He 


Lys He 






60 








Asn 


Asn 


Ser 


Lys 


He 


Lys Gin 




75 








80 


Val 


Thr 


Asp 


Trp 


Tyr 


Gly Ala 


90 










95 


Tyr 


Arg 


Gly 


Cys 


Gly 


Lys Glu 










110 




Phe 


Leu 


Leu 


Asn 


Asp 


Asn Leu 








125 






Val 


Phe 


Val 


His 


Glu 


Trp Ala 






140 








Tyr 


Asn 


Asn 


Asp 


Lys 


Pro Phe 




155 








160 


Val 


.Thr 


Arg 


Cys 


Ser 


Ser Asp 


170 










175 


Gly 


Pro 


Cys 


Pro 


Gin 


Glu Asn 










190 




Gly 


Cys 


Thr 


Phe 


He 


Tyr Asn 








205 






Met 


Phe 


Met 


Gin 


Ser 


Leu Ser 






220 








Thr 


His 


Asn 


Gin 


Glu 


Ala Pro 




235 








240 


Arg 


Ser 


Ala 


Trp 


Asp 


Val He 


250 










255 


Phe 


Pro 


Met 


Asn 


Gly 


Thr Glu 



270 



Val Glu Ala Gly Asp Lys Val 
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275 




Val Cys 


Leu 


Val 


290 






Leu Leu 


Gin 


Leu 


305 






Glu lie 


His 


Thr 


lie Arg 


Ala 


Gin 






340 


Leu Val 


Ser 


Tyr 




355 




lie Cys 


Ser 


Gly 


370 






Gly Lys 


Ala 


Tyr 


385 






Lys Leu 


Leu 


Gly 


lie His 


Ser* 


He 






420 


Leu Ser 


Ara 


Leu 




435 




Asn Ser 


Asn 


Ser 


450 






Gly Asp 


He 


Phe 


465 






Val Lys 


Pro 


His 


Val Gly 


Asn 


Asp 






500 


Pro Glu 


He 


He 




515 




Asn Phe 


He 


Thr 


530 






Gly Thr 


Ala 


Lvs 


545 






His Ser 


Leu 


Gin 


Ser Ala 


Val 


Pro 






580 


Leu His 


Phe 


Pro 




595 




phe Tyr 


Pro 


He 


610 






Thr Gly 


Asp 


Pro 


625 






Asp Val 


He 


Lys 


Ala Ala 


Asn 


Gly 






660 


Ser He 


Ser 


Thr 




675 




Val Pro 


Gly 


Tyr 


690 






Lys Ser 


Val 


Gly 


705 






Val Ser 


Ser 


Gly 


His Pro 


Asp 


Val 





280 


Leu Asp Val 


Ser 


295 




Gin Gin Ala 


Ala 


310 




Phe Val Gly 


He 


325 




Leu His Gin 


He 


Leu Pro Thr 


Thr 




360 


Leu Lys Lys. 


Gly 


375 




Gly Ser Val 


Met 


390 




Asn Cys Leu 


Pro 


405 




Ala Leu Gly 


Ser 


Thr Gly Gly 


Leu 




440 


Met He Asp 


Ala 


455 




Gin Gin His 


He 


470 




His Gin Leu 


Lvs 


485 




Thr Met Phe 


Leu 


Leu Phe Asp 


Pro 




520 


Asn Leu Thr 


Phe 


535 




Pro Gly His 


Trp 


550 




Ala Leu Lys 


Val 


565 




Pro Ala Thr 


Val 


His Pro Val 


Met 




600 


Leu Asn Ala 


Thr 


615 




Val Thr Leu 


Ara 


630 




Asn Asn Glv 


He 


645 




Arg Tyr Ser 


Leu 


Pro Ala His 


Ser 




680 


Thr Ala Asn 


Gly 


695 




Arg Asn Glu 


Glu 


710 




Gly Ser Phe 


Ser 


725 




Phe Pro Pro 


Cys 



Ser 


Lys 


Met 


Ala 








300 


Glu 


Phe 


Tyr 


Leu 






315 




Ala 


Ser 


Phe 


Asp 




330 






Asn 


Ser 


Asn 


Asp 


345 








Val 


Ser 


Ala 


Lys 


Phe 


Glu 


Val 


Val 








380 


He 


Leu. 


Val 


Thr 






395 




Thr 


Val 


Leu 


Ser 




410 






Ser 


Ala 


Ala 


Pro 


425 








Lys 


Phe 


Phe 


Val 


Phe 


Ser 


Arg 


He 








460 


Gin 


Leu 


Glu 


Ser 






475 




Asn 


Thr 


Val 


Thr 




490 






Val 


Thr 


Trp 


Gin 


505 








Asp 


Glv 


Arg 


Lys 


Arcr 


Thr 


Ala 


Ser 








540 


Thr 


Tyr 


Thr 


Leu 






555 




Thr 


Val 


Thr 


Ser 




570 






Glu 


Ala 


Phe 


Val 


585 








He 


Tvr 


Ala 


Asn 


Val 


Thr 


Ala 


Thr 








620 


Leu 


Leu 


Asp 


Asp 






635 




Tyr 


Ser 


Arg 


Tyr 




650 






Lys 


Val 


His 


Val 


665 








He 


Pro 


Gly 


Ser 


Asn 


He 


Gin 


Met 








700 


Glu 


Arg 


Lys 


Trp 






715 




Val 


Leu 


Gly 


Val 




730 






Lys 


He 


He 


Asp 



285 



Glu 


Ala 


Asp 


Arg 


Met 


Gin 


He 


Val 








320 


Ser 


Lvs 


Gly 


Glu 






335 




Asp 


Arg 


Lys 


Leu 




350 






Thr 


Asp 


He 


Ser 


365 








Glu 


Lys 


Leu 


Asn 


Ser 


Gly 


Asp 


Asp 








400 


Ser 


Gly 


Ser 


Thr 






415 




Asn 


Leu 


Glu 


Glu 




430 






Pro 


Asp 


He 


Ser 


445 








Ser 


Ser 


Glv 


Thr 


Thr 


Glv 


Glu 


Asn 








480 


Val 


Asp 


Asn 


Thr 






495 




Ala 


Ser 


Gly 


Pro 




510 






Tyr 


Tyr 


Thr 


Asn 


525 








Leu 


Trp 


He 


Pro 


Asn 


Asn 


Thr 


His 








560 


Ara 


Ala 


Ser 


Asn 






575 




Glu 


Ara 


Asp 


Ser 




590 






Val 


Lys 


Gin 


Gly 


605 








Val 


Glu 


Pro 


Glu 


Gly 


Ala 


Glv 


Ala 








640 


Phe 


Phe 


Ser 


Phe 






655 




Asn 


His 


Ser 


Pro 




670 






His 


Ala 


Met 


Tyr 


685 








Asn 


Ala 


Pro 


Arg 


Gly 


Phe 


Ser 


Arg 








720 


Pro 


Ala 


Gly 


Pro 






735 




Leu 


Glu 


Ala 


Val 
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740 




Lys 


Val 


Glu 


Glu 


Glu Leu Thr Leu 






755 




760 


Phe 


Asp 


Gin 


Gly 


Gin Ala Thr Ser 




770 






775 


Leu 


Gin 


Asn 


He 


Gin Asp Asp Phe 


785 








790 


Ser 


Lys 


Arg 


Asn 


Pro Gin Gin Ala 










805 


Ser 


Pro 


Gin 


He 


Ser Thr Asn Gly 








820 




Thr 


His 


Glu 


Ser 


His Arg He Tyr 






835 




840 


Asn 


Ser 


Leu 


Gin 


Ser Ala Val Ser 




850 






855 


lie 


Pro 


Pro 


Asn 


Ser Asp Pro Val 


865 








870 


Lys 


Gly 


Val 


Leu 


Thr Ala Met Gly 










885 


He 


Val 


Val 


Thr 


His His Thr Leu 








900 




Lys 


Glu 


Asn 


Gly 


Thr Lys Leu Leu 






915 




920 



745 






750 






Ser Trp Thr 


Ala 


Pro Gly 


Glu 


Asp 






765 








Tyr Glu He 


Arg Met 


Ser 


Lys 


Ser 




780 








Asn Asn Ala 


He 


Leu 


Val 


Asn 


Thr 


795 










800 


Gly He Arg 


Glu 


He 


Phe 


Thr 


Phe 


810 








815 




Pro Glu His 


Gin 


Pro 


Asn 


Gly 


Glu 


825 






830 




Val Ala He 


Arg Ala 


Met 


Asp 


Arg 






845 








Asn He Ala 


Gin 
860 


Ala 


Pro 


Leu 


Phe 


Pro Ala Arg 


Asp 


Tyr 


Leu 


He 


Leu 


875 










880 


Leu He Gly 


He 


He 


Cys 


Leu 


He 


890 








895 




Ser Arg Lys 


Lys Arg Ala 


Asp 


Lys 


905 






910 







<210> 358 
<211> 2773 
<212> DNA 

<213> Homo sapiens 
<400> 358 

catatgcagc atcaccacca tcaccacgga 
ttgctcattg caattaatcc tcaggtacct 
gaaatgataa ctgaagcttc attttaccta 
agaaatataa agattttaat acctgccaca 
caagaatcat atgaaaaggc aaatgtcata 
gatccataca ccctacaata cagagggtgt 
cctaatttcc tactgaatga taacttaaca 
gtccatgaat gggcccacct ccgttggggt 
ttctacataa atgggcaaaa tcaaattaaa 
atttttgtgt gtgaaaaagg tccttgcccc 
aaagaaggat gcacctttat ctacaatagc 
atgcaaagtt tatcttctgt ggttgaattt 
ccaaacctac agaaccagat gtgcagcctc 
gctgactttc accacagctt tcccatgaac 
tcgcttgtag aggctggtga caaagtggtc 
gcagaggctg acagactcct tcaactacaa 
gttgaaattc ataccttcgt gggcattgcc 
cagctacacc aaattaacag caatgatgat 
actgtatcag ctaaaacaga catcagcatt 
gttgaaaaac tgaatggaaa agcttatggc 
gataagcttc ttggcaattg cttacccact 
attgccctgg gttcatctgc agccccaaat 
ttaaagttct ttgttccaga tatatcaaac 
atttcctctg gaactggaga cattttccag 
aatgtcaaac ctcaccatca attgaaaaac 
gacactatgt ttctagttac gtggcaggcc 
cctgatggac gaaaatacta cacaaataat 



gtacagcttc aagacaatgg gtataatgga 60 
gagaatcaga acctcatctc aaacattaag 120 
tttaatgcta ccaagagaag agtatttttc 180 
tggaaagcta ataataacag caaaataaaa 240 
gtgactgact ggtatggggc acatggagat 300 
ggaaaagagg gaaaatacat tcatttcaca 360 
gctggctacg gatcacgagg ccgagtgttt 420 
gtgttcgatg agtataacaa tgacaaacct 480 
gtgacaaggt gttcatctga catcacaggc 540 
caagaaaact gtattattag taagcttttt 600 
acccaaaatg caactgcatc aataatgttc 660 
tgtaatgcaa gtacccacaa ccaagaagca 720 
agaagtgcat gggatgtaat cacagactct 780 
gggactgagc ttccacctcc tcccacattc 840 
tgtttagtgc tggatgtgtc cagcaagatg 900 
caagccgcag aattttattt gatgcagatt 960 
agtttcgaca gcaaaggaga gatcagagcc 1020 
cgaaagttgc tggtttcata tctgcccacc 1080 
tgttcagggc ttaagaaagg atttgaggtg 1140 
tctgtgatga tattagtgac cagcggagat 1200 
gtgctcagca gtggttcaac aattcactcc 1260 
ctggaggaat tatcacgtct tacaggaggt 1320 
tccaatagca tgattgatgc tttcagtaga 1380 
caacatattc agcttgaaag tacaggtgaa 1440 
acagtgactg tggataatac tgtgggcaac 1500 
agtggtcctc ctgagattat attatttgat 1560 ' 
tttatcacca atctaacttt tcggacagct 1620 
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agtctttgga ttccaggaac agctaagcct gggcactgga cttacaccct gaacaatacc 1680 
catcattctc tgcaagccct gaaagtgaca gtgacctctc gcgcctccaa ctcagctgtg 1740 
cccccagcca ctgtggaagc ctttgtggaa agagacagcc tccattttcc tcatcctgtg 1800 
atgatttatg ccaatgtgaa acagggattt tatcccattc ttaatgccac tgtcactgcc 1860 
acagttgagc cagagactgg agatcctgtt acgctgagac tccttgatga tggagcaggt 1920 
gctgatgtta taaaaaatga tggaatttac tcgaggtatt ttttctcctt tgctgcaaat 1980 
ggtagatata gcttgaaagt gcatgtcaat cactctccca gcataagcac cccagcccac 2040 
tctattccag ggagtcatgc tatgtatgta ccaggttaca cagcaaacgg taatattcag 2100 
atgaatgctc caaggaaatc agtaggcaga aatgaggagg agcgaaagtg gggctttagc 2160 
cgagtcagct caggaggctc cttttcagtg ctgggagttc cagctggccc ccaccctgat 2220 
gtgtttccac catgcaaaat tattgacctg gaagctgtaa aagtagaaga ggaattgacc 2280 
ctatcttgga cagcacctgg agaagacttt gatcagggcc aggctacaag ctatgaaata 2340 
agaatgagta aaagtctaca gaatatccaa gatgacttta acaatgctat tttagtaaat 2400 
acatcaaagc gaaatcctca gcaagctggc atcagggaga tatttacgtt ctcaccccaa 2460 
atttccacga atggacctga acatcagcca aatggagaaa cacatgaaag ccacagaatt 2520 
tatgttgcaa tacgagcaat ggataggaac tccttacagt ctgctgtatc taacattgcc 2580 
caggcgcctc tgtttattcc ccccaattct gatcctgtac ctgccagaga ttatcttata 2640 
ttgaaaggag ttttaacagc aatgggtttg ataggaatca tttgccttat tatagttgtg 2700 
acacatcata ctttaagcag gaaaaagaga gcagacaaga aagagaatgg aacaaaatta 2760 
ttataatgaa ttc 2773 

<210> 359 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR primer 
<400> 359 

tggcagcccc tcttcttcaa gtggc 25 



<210> 360 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 360 

cgccagaatt catcaaacaa atctgttagc acc 

<210> 361 
<211> 77 
<212> PRT 

<213> Homo sapiens 



33 



<400> 361 

Met Gin His His His His His His Trp Gin Pro Leu Phe Phe Lys Trp 

15 10 15 

Leu Leu Ser Cys Cys Pro Gly Ser Ser Gin He Ala Ala Ala Ala Ser 

20 25 30 

Thr Gin Pro Glu Asp Asp He Asn Thr Gin Arg Lys Lys Ser Gin Glu 

35 40 45 

Lys Met Arg Glu Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu Thr 

50 55 60 

He Pro Gin Thr Ser Ser His Gly Ala Asn Arg Phe Val 
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65 70 75 



<210> 362 
<211> 244 
<212> DNA 

<213> Homo sapiens 
<400> 362 

catatgcagc atcaccacca tcaccactgg cagcccctct tcttcaagtg gctcttgtcc 60 
tgttgccctg ggagttctca aattgctgca gcagcctcca cccagcctga ggatgacatc 120 
aatacacaga ggaagaagag tcaggaaaag atgagagaag ttacagactc tcctgggcga 180 
ccccgagagc ttaccattcc tcagacttct tcacatggtg ctaacagatt tgtttgatga 240 
attc 244 

<210> 363 
<211> 20 

<212> PRT 

<213> Homo sapiens 
<400> 363 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

1 5 10 . 

Ser Ser Gin lie 
20 



<210> 364 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<400> 364 

atgtggcagc ccctcttctt caagtggctc ttgtcctgtt gccctgggag ttctcaaatt 60 

<210> 365 
<211> 20 

<212> PRT 

<213> Homo sapiens 
<400> 365 

Gly Ser Ser Gin lie Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp 

15 10 15 

lie Asn Thr Gin 
20 



<210> 366 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<400> 366 

gggagttctc aaattgctgc agcagcctcc acccagcctg aggatgacat caatacacag 60 



<210> 367 



1 
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<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 367 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

15 10 15 

Gin Ala Leu Lys 
20 



<210> 368 

<211> 2343 

<212> DNA 

<213> Homo sapiens 

<400> 368 

attccggagc gtttgcggct tcgcttcatg gccgctctcc cgcccctcct gggatctgtg 60 
gggagctggg gagcccgcag cggcccggag ccggagctgg cgagccgagc ggagacctgt 120 
gcgccgcgcc tctgaggcgc agcatgtgaa gcggagacgg catccagtgg ggggcgagcc 180 
tctcagccgg ccgggatggc taccacggcc gagctcttcg aggagccttt tgtggcagat 240 
gaatatattg aacgtcttgt atggagaacc ccaggaggag gctctagagg tggacctgaa 300 
gcttttgatc ctaaaagatt attagaagaa tttgtaaatc atattcagga actccagata 360 
atggatgaaa ggattcagag gaaagtagag aaactagagc aacaatgtca gaaagaagcc 420 
aaggaatttg ccaagaaggt acaagagctg cagaaaagca atcaggttgc cttccaacat 480 
ttccaagaac tagatgagca cattagctat gtagcaacta aagtctgtca ccttggagac 540 
cagttagagg gggtaaacac acccagacaa cgggcagtgg aggctcagaa attgatgaaa 600 
tactttaatg agtttctaga tggagaattg aaatctgatg tttttacaaa ttctgaaaag 660 
ataaaggaag cagcagacat cattcagaag ttgcacctaa ttgcccaaga gttacctttt 720 
gatagatttt cagaagttaa atccaaaatt gcaagtaaat accatgattt agaatgccag 780 
ctgattcagg agtttaccag tgctcaaaga agaggtgaaa tctccagaat gagagaagta 840 
gcagcagttt tacttcattt taagggttat tcccattgtg ttgatgttta tataaagcag 900 
tgccaggagg gtgcttattt gagaaatgat atatttgaag acgctggaat actctgtcaa 960 
agagtgaaca aacaagttgg agatatcttc agtaatccag aaacagtcct ggctaaactt 1020 
attcaaaatg tatttgaaat caaactacag agttttgtga aagagcagtt agaagaatgt 1080 
aggaagtccg atgcagagca atatctcaaa aatctctatg atctgtatac aagaaccacc 1140 
aatctttcca gcaagctgat ggagtttaat ttaggtactg ataaacagac tttcttgtct 1200 
aagcttatca aatccatttt catttcctat ttggagaact atattgaggt ggagactgga 12 60 
tatttgaaaa gcagaagtgc tatgatccta cagcgctatt atgattcgaa aaaccatcaa 1320 
aagagatcca ttggcacagg aggtattcaa gatttgaagg aaagaattag acagcgtacc 1380 
aacttaccac ttgggccaag tat.cgatact catggggaga cttttctatc ccaagaagtg 1440 
gtggttaatc ttttacaaga aaccaaacaa gcctttgaaa gatgtcatag gctctctgat 1500 
ccttctgact taccaaggaa tgccttcaga atttttacca ttcttgtgga atttttatgt 1560 
attgagcata ttgattatgc tttggaaaca ggacttgctg gaattccctc ttcagattct 1620 
aggaatgcaa atctttattt tttggacgtt gtgcaacagg ccaatactat ttttcatctt 1680 
tttgacaaac agtttaatga tcaccttatg ccactaataa gctcttctcc taagttatct 1740 
gaatgccttc agaagaaaaa agaaataatt gaacaaatgg agatgaaatt ggatactggc 1800 
attgatagga cattaaattg tatgattgga cagatgaagc atattttggc tgcagaacag 1860 
aagaaaacag attttaagcc agaagatgaa aacaatgttt tgattcaata tactaatgcc 1920 
tgtgtaaaag tctgtgctta cgtaagaaaa caagtggaga agattaaaaa ttccatggat 1980 
gggaagaatg tggatacagt tttgatggaa cttggagtac gttttcatcg acttatctat 2040 
gagcatcttc aacaatattc ctacagttgt atgggtggca tgttggccat ttgtgatgta 2100 
gccgaatata ggaagtgtgc caaagacttc aagattccaa tggtattaca tctttttgat 2160 
actctgcatg ctctttgcaa tcttctggta gttgccccag ataatttaaa gcaagtctgc 2220 
tcaggagaac aacttgctaa tctggacaag aat atactic actccttcgt acaacttcgt 2280 
gctgattata gatctgcccg ccttgctcga cacttcagct gagattgaat ttacaaagga 2340 
att 2343 



wo 02/00174 



PCT/USOl/21065 



165 



<210> 369 
<211> 708 
<212> PRT 
<213>Homo sapiens 

<400> 369 

Met Ala Thr Thr Ala Glu Leu Phe Glu Glu Pro Phe Val Ala Asp Glu 

15 10 15 

Tyr lie Glu Arg Leu Val Trp Arg Thr Pro Gly Gly Gly Ser Arg Gly 

20 25 30 

Gly Pro Glu Ala Phe Asp Pro Lys Arg Leu Leu Glu Glu Phe Val Asn 

35 40 45 

His lie Gin Glu Leu Gin lie Met Asp Glu Arg He Gin Arg Lys Val 

50 55 60 

Glu Lys Leu Glu Gin Gin Cys Gin Lys Glu Ala Lys Glu Phe Ala Lys 
65 70 75 80 

Lys Val Gin Glu Leu Gin Lys Ser Asn Gin Val Ala Phe Gin His Phe 

85 90 95 

Gin Glu Leu Asp Glu His He Ser Tyr Val Ala Thr Lys Val Cys His 

100 105 110 

Leu Gly Asp Gin Leu Glu Gly Val Asn Thr Pro Arg Gin Arg Ala Val 

115 120 125 

Glu Ala Gin Lys Leu Met Lys Tyr Phe Asn Glu Phe Leu Asp Gly Glu 

130 135 140 

Leu Lys Ser Asp Val Phe Thr Asn Ser Glu Lys He Lys Glu Ala Ala 
145 150 155 160 

Asp He He Gin Lys Leu His Leu He Ala Gin Glu Leu Pro Phe Asp 

165 170 175 

Arg Phe Ser Glu Val Lys Ser Lys He Ala Ser Lys Tyr His Asp Leu 

180 185 190 

Glu Cys Gin Leu He Gin Glu Phe Thr Ser Ala Gin Arg Arg Gly Glu 

195 200 205 

He Ser Arg Met Arg Glu Val Ala Ala Val Leu Leu His Phe Lys Gly 

210 215 220 

Tyr Ser His Cys Val Asp Val Tyr He Lys Gin Cys Gin Glu Gly Ala 
225 230 235 240 

Tyr Leu Arg Asn Asp He Phe Glu Asp Ala Gly He Leu Cys Gin Arg 

245. 250 255 

Val Asn Lys Gin Val Gly Asp He Phe Ser Asn Pro Glu Thr Val Leu 

260 265 270 

Ala Lys Leu He Gin Asn Val Phe Glu He Lys Leu Gin Ser Phe Val 

275 280 285 

Lys Glu Gin Leu Glu Glu Cys Arg Lys Ser Asp Ala Glu Gin Tyr Leu 

290 295 300 

Lys Asn Leu Tyr Asp Leu Tyr Thr Arg Thr Thr Asn Leu Ser Ser Lys 
305 310 315 320 

Leu Met Glu Phe Asn Leu Gly Thr Asp Lys Gin Thr Phe Leu Ser Lys 

325 330 335 

Leu He Lys Ser He Phe He Ser Tyr Leu Glu Asn Tyr He Glu Val 

340 345 350 

Glu Thr Gly Tyr Leu Lys Ser Arg Ser Ala Met He Leu Gin Arg Tyr 

355 360 365 

Tyr Asp Ser Lys Asn His Gin Lys Arg Ser He Gly Thr Gly Gly He 

370 375 380 

Gin Asp Leu Lys Glu Arg He Arg Gin Arg Thr Asn Leu Pro Leu Gly 
385 390 395 400 

Pro Ser He Asp Thr His Gly Glu Thr Phe Leu Ser Gin Glu Val Val 
405 410 415 
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Val 


Asn 


Leu 


Leu Gin Glu 


Thr Lys Gin Ala 


Phe Glu Ara Cvs His 


Arg 








420 


425 


430 




Leu 


Ser 


Asp 


Pro Ser Asp 


Leu Pro Arg Asn 


Ala Phe Arcr Tie Phe 


Thr 






435 




440 


445 




lie 


Leu 


Val 


Glu Phe Leu 


Cys lie Glu His 


He Asn Tvr* Ala T»eii 


Glu 




450 






455 


460 




Thr 


Glv 


Leu 


Ala Gly lie 


Pro Ser Ser Asp 


Ser Arcr Asn Al a Asn 

vN^J. XU. V| X^«9XX X>XCI XmSXI 


Leu 


465 






470 




475 


480 


Tvr 


Phe 


Leu 


Asp Val Val 


Gin Gin Ala Asn 


Thr He Phe His Leu 


Phe 








485 


490 


495 




Asp 


Lvs 


Gin 


Phe Asn Asp 


His Leu Met Pro 


Leu He Ser Ser Ser 


Pro 








500 


505 


510 




Lvs 


Leu 


Ser 


Glu Cys Leu 


Gin Lys Lys Lys 


Glu He He Glu Gin 


Met 






515 




520 


525 




Glu 


Met 


Lys 


Leu Asp Thr 


Glv lie AsD Ara 


Thr Leu Asn Cvs Met 


He 




530 






535 


' 540 




Glv 


Gin 


Met 


Lys Hi s lie 


Leu Ala Ala Glii 


Gin Lvs Tivs Thr Asn 


Phe 


545 






550 




555 


560 


Lys 


Pro 


Glu 


Asp Glu Asn 


Asn Val Leu 11^ 


Gl n Tvr Thr Asn Ala 

WXXA X yX X XXX XIOXX XXXd 


Wjf o 








565 


570 


575 




Val 


Lys 


Val 


Cvs Ala Tvr 


Val Arcr Lvs Gin 

V M Jm y o «jxxi 


Val Rln TiVS Tie T.vq 

V dX V7X Ul J-J_y o XXvf UyO 


A cn 








580 


585 






Ser 


Met 


Asp 


Glv Lvs Asn 


Val Asn Thr Val 


i jc i_ vjxvi jj^u vjx_y 


V Cl X 






595 




600 


605 






Phe 


His 


y jjcu X X c 


TvY* mn PiQ T.011 
•Lyj. oxu xi^o Jjcu 


uxii ^xn xyx oex -i-yx 


Ser 




610 






615 


620 




Cys 


Met 


Gly 


Gly Met Leu 


Ala lie Cys Asp 


Val Ala Glu Tyr Arg 


Lys 


625 






630 




635 


640 


Cys 


Ala 


Lys 


Asp Phe Lys 


lie Pro Met Val 


Leu His Leu Phe Asp 


Thr 








645 


650 


655 




Leu 


His 


Ala 


Leu Cys Asn 


Leu Leu Val Val 


Ala Pro Asp Asn Leu 


Lys 








660 


665 


670 




Gin 


Val 


Cys 


Ser Gly Glu 


Gin Leu Ala Asn 


Leu Asp Lys Asn He 


Leu 






675 




680 


685 




His 


Ser 


Phe 


Val Gin Leu 


Arg Ala Asp Tyr 


Arg Ser Ala Arg Leu 


Ala 




690 






695 


700 




Arg 


His 


Phe 


Ser 









705 



<210> 370 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 370 

gtcaatcact ctcccagcat aagcacccca gcccactcta ttccagggag tcatgctatg 60 

<210> 371 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 371 

agtagaattt cctctggaac tggagacatt ttccagcaac atattcagct tgaaagtaca 60 
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<210> 372 

<211> 60 

<212> DMA 

<213> Homo sapiens 

<400> 372 

ccagagactg gagatcctgt tacgctgaga ctccttgatg atggagcagg tgctgatgtt 60 



<210> 373 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<400> 373 

ttacagtctg ctgtatctaa cattgcccag gcgcctctgt ttattccccc caattctgat 60 



<210> 374 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 374 

gctgtgcccc cagccactgt ggaagccttt gtggaaagag acagcctcca ttttcctcat 60 

<210> 375 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 375 

aaaaacacag tgactgtgga taatactgtg ggcaacgaca ctatgtttct agttacgtgg 60 



<210> 376 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 376 

Leu Gin Ser /Via Val Ser Asn He Ala Gin Ala Pro Leu Phe He Pro 

15 10 15 

Pro Asn Ser Asp 
20 



<210> 377 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 377 

Val Asn His Ser Pro Ser He Ser Thr Pro Ala His Ser He Pro Gly 

15 10 15 

Ser His Ala Met 
20 
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<210> 378 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 378 

Pro Glu Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala 

15 10 15 

Gly Ala Asp Val 
20 



<210> 379 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 379 

Ala Val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser Leu 

15 10 15 

His Phe Pro His 
20 



<210> 380 

<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 380 

Ser Arg lie Ser Ser Gly Thr Gly Asp He Phe Gin Gin His lie Gin 

15 10 15 

Leu Glu Ser Thr 
20 



<210> 381 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 381 

Lys Asn Thr Val Thr Val Asp Asn Thr Val Gly Asn Asp Thr Met Phe 

15 10 15 

Leu Val Thr Trp 
20 



<210> 382 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 382 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 
15 10 15 
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Gin Ala Leu Lys 
20 



<210> 383 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 383 

cggcgaattc atggattggg ggacgctgc 29 

<210> 384 
<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 384 

cggcctcgag tcacccctct atccgaacct tctgc 35 

<210> 385 
<21i> 32 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR primer 
<400> 385 

cggcgaattc cacgaaccac tcgcaagttc ag 32 

<210> 386 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 386 

eggctcgagt tagcttgggc ctgtgattgc . 30 

<210> 387 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 387 

Phe Phe Lys Trp Leu Leu Ser Gys Gys Pro Gly Ser Ser Gin lie Ala 

1- 5 10 15 

Ala Ala Ala Ser 
20 
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<210> 388 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 388 

Leu Ser Cys Cys Pro Gly Ser Ser Gin He Ala Ala Ala Ser Thr Gin 

15 10 15 

Pro Glu Asp 



<210> 389 
<2.11> 20 
<212> PRT 

<213> Homo sapiens 
<400> 389 

Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp He Asn Thr Gin Arg 

1 5 10 ' 15 

Lys Lys Ser Gin 
20 



<210> 390 
<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 390 

Thr Gin Pro Glu Asp Asp He Asn Thr Gin Arg Lys Lys Ser Gin Glu 

15 10 15 

Lys Met Arg Glu 
20 



<210> 391 
<211> 20 - 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Asp He Asn Thr Gin Arg Lys Lys 

1 5 
Thr Asp Ser Pro 
20 



Ser Gin Glu Lys Met Arg Glu Val 
10 15 



<210> 392 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 392 

Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp Ser Pro Gly 
1 5 10 15 
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Arg Pro Arg Glu 
20 



<210> 393 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 393 

Glu Lys Met Arg Glu Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu 

15 10 15 

Thr He Pro Gin 
20 



<210> 394 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 394 

Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr 

1 5 10 15 

Ser Ser His Gly 
20 



<210> 395 
<211> 19 
<212> PRT 

<213> Homo sapiens 

<400> 395 

Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr Ser Ser His Gly Ala 

1.5 10 15 

Asn Arg Phe 



<210> 396 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 396 

Met Asn Lys Leu Tyr He Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 

15 10 15 

TVsp Leu Glu 



<210> 397 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 397 
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Ser Glu Asn Ala Ala Pro Ser Asp Leu Glu Ser lie Phe Lys Asp Ala 

15 10 15 

Lys lie Pro Val 
20 



<210> 398 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 398 

Ser lie Phe Lys Asp Ala Lys lie Pro Val Ser Gly Pro Phe Leu Val 

15 10 15 

Lys Thr Gly Tyr 
20 



<210> 399 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 399 

Ser Gly Pro Phe Leu Val Lys Thr 

1 5 
Asp Glu Ser Trp 
20 



Gly Tyr Ala Phe Val Asp Cys Pro 
10 15 



<210> 400 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 400 

Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys Ala He Glu 

1 5 10 15 

Ala Leu Ser Gly 
20 



<210> 401 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 401 

Ala Leu Lys Ala He Glu Ala Leu Ser Gly Lys He Glu Leu His Gly 

1 5 10 15 

Lys Pro He Glu 
20 



<210> 402 

<211> 20 

<212> PRT 

<213> Homo sapiens 
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<400> 402 

Lys lie Glu Leu His Gly Lys Pro lie Glu Val Glu His Ser Val Pro 

15 10 15 

Lys Arg Gin Arg 
20 



<210> 403 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 403 

Val Glu His Ser Val Pro Lys Arg Gin Arg He Arg Lys Leu Gin He 
15 10 15 

Arg Asn He Pro 
20 



<210> 404 
<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 404 

He Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu 

15 10 15 

Val Leu Asp Ser 
20 



<210> 405 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 405 

Ala Val Val Asn Val Thr Tyr Ser Ser Lys Asp Gin Ala Arg Gin Ala 

15 10 15 

Leu Asp Lys Leu 
20 



<210> 406 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 406 

Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu Glu 

1 5 10 15 

Asn Phe Thr Leu 
20 



<210> 407 
<211> 20 
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<212> PRT 

<213> Homo sapiens 

<400> 407 

Asn Gly Phe Gin Leu Glu Asn Phe Thr Leu Lys Val Ala Tyr lie Pro 

15 10 15 

Asp Glu Thr Ala 
20 



<210> 408 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 408 

Lys Val Ala Tyr lie Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Leu 

1 5 10 15 

Gin Gin Pro Arg 
20 



<210> 409 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 409 

Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly 

15 10 15 

Gin Arg Gly Ser 
20 



<210> 410 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 410 

Gly Arg Arg Gly Leu Gly Gin Arg Gly Ser Ser Arg Gin Gly Ser Pro 

15 10 15 

Gly Ser Val Ser 
20 



<210> 411 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 411 

Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys Pro Cys Asp 

1 5 * 10 15 

Leu Pro Leu Arg 
20 
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<210> 412 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 412 

Lys Gin Lys Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin 

1 5-10 15 

Phe Val Gly Ala 
20 



<210> 413 
<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 413 

Leu Leu Val Pro Thr Gin Phe Val Gly Ala lie lie Gly Lys Glu Gly 

1-5 10 15 

Ala Thr He Arg 
20 



<210> 414 
<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 414 

He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin Thr 

15 10 15 

Gin Ser Lys He 
20 



<210> 415 
<211> 20 

<212> PRT 

<213> Homo sapiens 
<400> 415 

Asn He Thr Lys Gin Thr Gin Ser Lys He Asp Val His Arg Lys Glu 

15 10 15 

Asn Ala Gly Ala 
20 



<210> 416 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 416 

Asp Val His Arg Lys Glu Asn Ala 

1- 5 
He Leu Ser Thr 
20 



Gly Ala Ala Glu Lys Ser He Thr 
10 15 



wo 02/00174 



176 



PCTAJSOl/21065 



<210> 417 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 417 

Ala Glu Lys Ser lie Thr lie Leu Ser Thr Pro Glu Gly Thr Ser Ala 

15 10 15 

Ala Cys Lys Ser 
20 



<210> 418 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 418 

Pro Glu Gly Thr Ser Ala Ala Cys Lys Ser lie Leu Glu lie Met His 

1 5 . 10 15 

Lys Glu Ala Gin 
20 



<210> 419 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 419 

He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys Phe Thr Glu 

15 10 15 

Glu He Pro Leu 
20 



<210> 420 
<211> 455 
<212> DNA 
<213> Homo sapiens 

.<400> 420 

gaagacatgc ttacttcccc ttcaccttcc 
gccctagcca acgccgcatg agagggagtg 
catctagaaa gaagcgctta agatgtggca 
ttgccctggg agttctcaaa ttgctgcagc 
tacacagagg aagaagagtc aggaaaagat 
ccgagagctt accattcctc agacttcttc 
taaagctcta gaggccgtca aattggcaat 
acatgtttac aataatgagg agcaggttgg 

<210> 421 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



ttcatgatgt gggaagagtg ctgcaaccca 60 
tgccgagggc ttctgagaag gtttctctca 120 
gcccctcttc ttcaagtggc tcttgtcctg 180 
agcctccacc cagcctgagg atgacatcaa 240 
gagagaagtt acagactctc ctgggcgacc 300 
acatggtgct aacagatttg ttcctaaaag 360 
agaagccggg ttccaccata ttgattctgc 420 
actgg 455 
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<220> 

<223> PGR primer 
<400> 421 

actagtgtcc gcgtggcggc ctac 24 

<210> 422 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 422 

catgagaatt catcacatgc ccttgaaggc tccc 34 

•<210> 423 
<211> 161 
<212> PRT 

<213> Homo sapiens 



<400> 423 



Met 


Gin 


His His His 


His 


His 


His 


His 


Thr Ser Val Arg Val Ala Ala 


1 




5 










10 15 


Tyr 


Phe 


Glu Asn Phe 


Leu 


Ala 


Ala 


Trp 


Arg Pro Val Lys Ala Ser Asp 






20 








25 


30 


Gly 


Asp 


Tyr Tyr Thr 


Leu 


Ala 


Val 


Pro 


Met Gly Asp Val Pro Met Asp 






35 






40 




45 


Gly 


He 


Ser Val Ala 


Asp 


He 


Gly 


Ala 


Ala Val Ser Ser He Phe Asn 




50 






55 






60 


Ser 


Pro 


Glu Glu Phe 


Leu 


Gly 


Lys 


Ala 


Val Gly Leu Ser Ala Glu Ala 


65 






70 








75 80 


Leu 


Thr 


He Gin Gin 


Tyr 


Ala 


Asp 


Val 


Leu Ser Lys Ala Leu Gly Lys 






85 










90 95 


Glu 


Val 


Arg Asp Ala 


Lys 


He 


Thr 


Pro 


Glu Ala Phe Glu Lys Leu Gly 






100 








105 


110 


Phe 


Pro 


Ala Ala Lys 


Glu 


He 


Ala 


Asn 


Met Cys Arg Phe Tyr Glu Met 






115 






120 




125 


Lys 


Pro 


Asp Arg Asp 


Val 


Asn 


Leu 


Thr 


His Gin Leu Asn Pro Lys Val 




13d 






135 






140 


Lys 


Ser 


Phe Ser Gin 


Phe 


He 


Ser 


Glu 


Asn Gin Gly Ala Phe Lys Gly 


145 






150 








155 160 



Met 



<210> 424 
<211> 489 
<212> DNA 

<213> Homo sapiens 
<400> 424 

atgcagcatc accaccatca ccaccacact 
tttctcgcgg cgtggcggcc cgtgaaagcc 
ccgatgggag atgtaccaat ggatggtatc 
agcattttta attctccaga ggaattttta 
ctaacaatac agcaatatgc tgatgttttg 



agtgtccgcg tggcggccta ctttgaaaac 60 
tctgatggag attactacac cttggctgta 120 
tctgttgctg atattggagc agccgtctct 180 
ggcaaggccg tggggctcag tgcagaagca 240 
tccaaggctt tggggaaaga agtccgagat 300 
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gcaaagatta ccccggaagc tttcgagaag ctgggattcc ctgcagcaaa ggaaatagcc 360 
aatatgtgtc gtttctatga aatgaagcca gaccgagatg tcaatctcac ccaccaacta 420 
aatcccaaag tcaaaagctt cagccagttt atctcagaga accagggagc cttcaagggc 480 
atgtgatga 

<210> 425 

<211> 32 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR primer 
<400> 425 

aacaaactgt atatcggaaa cctcagcgag aa 32 

<210> 426 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 426 

ccatagaatt cattacttcc gtcttgactg agg 33 

<210> 427 

<211> 586 

<212> PRT 

<213> Homo sapiens 



<400> 427 



Met 


Gin 


His 


His 


His 


His 


His 


His 


Asn 


Lys Leu Tyr He Gly Asn Leu 


1 








5 










10 15 


Ser 


Glu 


Asn 


Ala 


Ala 


Pro 


Ser Asp 


Leu 


Glu Ser He Phe Lys Asp Ala 








20 










25 


30 


Lys 


lie 


Pro 


Val 


Ser 


Gly 


Pro 


Phe 


Leu 


Val Lys Thr Gly Tyr Ala Phe 




35 










40 




45 


Val 


Asp 


Cys 


Pro 


Asp 


Glu 


Ser 


Trp 


Ala 


Leu Lys Ala He Glu Ala Leu 




50 










55 






60 


Ser 


Gly 


Lys 


lie 


Glu 


Leu 


His Gly 


Lys 


Pro He Glu Val Glu His Ser 


65 






70 








75 80 


Val 


Pro 


Lys 


Arg 


Gin 


Arg 


He, 


Arg 


Lys 


Leu Gin He Arg Asn He Pro 






85 








90 95 


Pro 


His 


Leu 


Gin 


Trp 


Glu 


Val 


Leu 


Asp 


Ser Leu Leu Val Gin Tyr Gly 








100 










105 


110 


Val 


Val 


Glu 


Ser 


Cys 


Glu 


Gin 


Val 


Asn 


Thr Asp Ser Glu Thr TVla Val 






115 








120 




125 


Val 


Asn 


Val 


Thr 


Tyr 


Ser 


Ser 


Lys 


Asp 


Gin Ala Arg Gin Ala Leu Asp 




130 










135 






140 


Lys 


Leu 


Asn 


Gly 


Phe 


Gin 


Leu 


Glu 


Asn 


Phe Thr Leu Lys Val Ala Tyr 


145 








150 








155 160 


lie 


Pro 


Asp 


Glu 


Thr 


Ala 


Ala 


Gin 


Gin 


Asn Pro Leu Gin Gin Pro Arg 








165 










170 175 


Gly 


Arg 


Arg 


Gly 


Leu 


Gly 


Gin 


Arg 


Gly 


Ser Ser Arg Gin Gly Ser Pro 




180 










185 


190 


Gly 


Ser 


Val 


Ser 


Lys 


Gin 


Lys 


Pro 


Cys 


Asp Leu Pro Leu Arg Leu Leu 
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195 






200 






205 




Val 


Pro 


Thr 


Gin 


Phe 


Val Gly Ala 


He 


He Gly Lys 


Glu Gly Ala 


Thr 




210 








215 




220 






He 


Arg 


Asn 


He 


Thr Lys Gin Thr 


Gin 


Ser Lys He 


Asp Val His 


Arg 


225 










230 




235 




240 


Lys Glu 


Asn 


Ala 


Gly Ala Ala Glu 


Lys 


Ser He Thr 


He Leu Ser 


Thr 










245 






250 


255 




Pro 


Glu 


Gly 


Thr 


Ser 


Ala Ala Cys 


Lys 


Ser He Leu 


Glu He Met 


His 








260 






265 




270 




Lys 


Glu 


Ala 


Gin 


Asp He Lys Phe 


Thr 


Glu Glu He 


Pro Leu Lys 


He 






275 






280 






285 




Leu 


Ala 


His 


Asn 


Asn 


Phe Val Gly 


Arg 


Leu He Gly 


Lys Glu Gly 


Arg 




290 








.295 




300 






Asn 


Leu 


Lys 


Lys 


He Glu Gin Asp 


Thr 


Asp Thr Lys 


He Thr He 


Ser 


305 










310 




315 




320 


Pro 


Leu 


Gin 


Glu 


Leu 


Thr Leu Tyr 


Asn 


Pro Glu Arg 


Thr He Thr 


Val 










325 






330 


335 




Lys 


Gly 


Asn 


Val 


Glu 


Thr Cys Ala 


Lys 


Ala Glu Glu 


Glu He Met 


Lys 








340 






345 




350 




Lys 


He. 


Arg 


Glu 


Ser 


Tyr Glu Asn 


Asp 


He Ala Ser 


Met Asn Leu 


Gin 






355 






360 






365 




Ala 


His 


Leu 


He 


Pro Gly Leu Asn 


Leu 


Asn Ala Leu 


Gly Leu Phe 


Pro 




370 








375 




380 






Pro 


Thr 


Ser 


Gly 


Met 


Pro Pro Pro 


Thr 


Ser Gly Pro 


Pro Ser Ala 


Met 


385 










390 




395 




400 


Thr 


Pro 


Pro 


Tyr 


Pro 


Gin Phe Glu 


Gin 


Ser Glu Thr 


Glu Thr Val 


His 










405 






410 


415 




Leu 


Phe 


He 


Pro 


Ala 


Leu Ser Val 


Gly 


Ala He He 


Gly Lys Gin 


Gly 








420 






425 




J 30 




Gin 


His 


He 


Lys 


Gin 


Leu Ser Arg 


Phe 


Ala Gly Ala 


Ser He Lys 


He 






435 






440 






445 




Ala 


Pro 


Ala 


Glu 


Ala 


Pro Asp Ala 


Lys 


Val Arg Met 


Val He He 


Thr 




450 








455 




460 






Gly 


Pro 


Pro 


Glu 


Ala 


Gin Phe Lys 


Ala 


Gin Gly Arg 


He Tyr Gly 


Lys 


465 










470 




475 




480 


He 


Lys 


Glu 


Glu 


Asn 


Phe Val Ser 


Pro 


Lys Glu Glu 


Val Lys Leu 


Glu 










485 






490 


495 




Ala 


His 


He 


Arg 


Val 


Pro Ser Phe 


Ala 


Ala Gly Arg 


Val He Gly 


Lys 








500 






505 




510 




Gly Gly 


Lys 


Thr 


Val 


Asn Glu Leu 


Gin 


Asn Leu Ser 


Ser Ala Glu 


Val 






515 






520 






525 




Val 


Val 


Pro 


Arg 


Asp Gin Thr Pro 


Asp 


Glu Asn Asp 


Gin Val Val 


Val 




530 








535 




540 






Lys 


He 


Thr 


Gly 


His Phe Tyr Ala 


Cys 


Gin Val Ala 


Gin Arg Lys 


He 


545 










550 




555 




560 


Gin 


Glu 


He 


Leu 


Thr 


Gin Val Lys 


Gin 


His Gin Gin 


Gin Lys Ala 


Leu 










565 






570 


575 




Gin 


Ser 


Gly 


Pro 


Pro 


Gin Ser Arg 


Arg 


Lys 












580 






585 









<210> 428 

<211> 1764 

<212> DNA 

<213> Homo sapiens 

<400> 428 

atgcagcatc accaccatca ccacaacaaa ctgtatatcg gaaacctcag cgagaacgcc 60 
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gccccctcgg acctagaaag tatcttcaag 
ctggtgaaga ctggctacgc gttcgtggac 
atcgaggcgc tttcaggtaa aatagaactg 
gtcccaaaaa ggcaaaggat tcggaaactt 
tgggaggtgc tggatagttt actagtccag 
aacactgact cggaaactgc agttgtaaat 
caagcactag acaaactgaa tggatttcag 
atccctgatg aaacggccgc ccagcaaaac 
cttgggcaga ggggctcctc aaggcagggg 
tgtgatttgc ctctgcgcct gctggttccc 
gaaggtgcca ccattcggaa catcaccaaa 
aaagaaaatg cgggggctgc tgagaagtcg 
tctgcggctt gtaagtctat tctggagatt 
acagaagaga tccccttgaa gattttagct 
aaagaaggaa gaaatcttaa aaaaattgag 
ccattgcagg aattgacgct gtataatcca 
gagacatgtg ccaaagctga ggaggagatc 
gatattgctt ctatgaatct tcaagcacat 
ggtctgttcc cacccacttc agggatgcca 
actcctccct acccgcagtt tgagcaatca 
gctctatcag tcggtgccat catcggcaag 
tttgctggag cttcaattaa gattgctcca 
gtgattatca ctggaccacc agaggctcag 
attaaagaag aaaactttgt tagtcctaaa 
gtgccatcct ttgctgctgg cagagttatt 
cagaatttgt caagtgcaga agttgttgtc 
caagtggttg tcaaaataac tggtcacttc 
caggaaattc tgactcaggt aaagcagcac 
cctcagtcaa gacggaagta atga 



gacgccaaga tcccggtgtc gggacccttc 120 
tgcccggacg agagctgggc cctcaaggcc 180 
cacgggaaac ccatagaagt tgagcactcg 240 
cagatacgaa atatcccgcc tcatttacag 300 
tatggagtgg tggagagctg tgagcaagtg 360 
gtaacctatt ccagtaagga ccaagctaga 420 
ttagagaatt tcaccttgaa agtagcctat 480 
cccttgcagc agccccgagg tcgccggggg 540 
tctccaggat ccgtatccaa gcagaaacca 600 
acccaatttg ttggagccat cataggaaaa 660 
cagacccagt ctaaaatcga tgtccaccgt 720 
attactatcc tctctactcc tgaaggcacc 780 
atgcataagg aagctcaaga tataaaattc 840 
cataataact ttgttggacg tcttattggt 900 
caagacacag acactaaaat cacgatatct 960 
gaacgcacta ttacagttaa aggcaatgtt 1020 
atgaagaaaa tcagggagtc ttatgaaaat 1080 
ttaattcctg gattaaatct gaacgccttg 114 0 
cctcccacct cagggccccc ttcagccatg 1200 
gaaacggaga ctgttcatct gtttatccca 1260 
cagggccagc acatcaagca gctttctcgc 1320 
gcggaagcac cagatgctaa agtgaggatg 1380 
ttcaaggctc agggaagaat ttatggaaaa 1440 
gaagaggtga aacttgaagc tcatatcaga 1500 
ggaaaaggag gcaaaacggt gaatgaactt 1560 
cctcgtgacc agacacctga tgagaatgac 1620 
tatgcttgcc aggttgccca gagaaaaatt 1680 
caacaacaga aggctctgca aagtggacca 1740 

1764 



<210> 429 
<211> 35 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PGR primer 



<400> 429. 

ccatggaatt cattatttca atataagata atctc 35 

<210> 430 
<211> 881 
<212> PRT 
<213> Homo sapiens 



<400> 430 



Met 


Gin 


His His His 


His 


His 


His 


1 




5 








Tyr Asn 


Gly Leu Leu 


He 


Ala 


He 






20 








Asn 


Leu 


He Ser Asn 


He 


Lys 


Glu 






35 






40 


Leu 


Phe 


Asn Ala Thr 


Lys 


Urg 


Arg 




50 






55 




Leu 


lie 


Pro Ala Thr 


Trp Lys 


Ala 


65 






70 






Glu 


Ser 


Tyr Glu Lys 


Ala 


Asn 


Val 



Gly Val Gin Leu Gin Asp Asn Gly 

10 15 
Asn Pro Gin Val Pro Glu Asn Gin 
25 30 
Met He Thr Glu Ala Ser Phe Tyr 
45 

Val Phe Phe Arg Asn He Lys He 
60 

Asn Asn Asn Ser Lys He Lys Gin 

75 80 
He Val Thr Asp Trp Tyr Gly Ala 
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85 90 95 

His Gly Asp Asp Pro Tyr Thr Leu Gin Tyr Arg Gly Cys Gly Lys Glu 

100 105 110 

Gly Lys Tyr lie His Phe Thr Pro Asn Phe Leu Leu Asn Asp Asn Leu 

115 120 125 

Thr Ala Gly Tyr Gly Ser Arg Gly Arg Val Phe Val His Glu Trp Ala 

130 135 140 

His Leu Arg Tip Gly Val Phe Asp Glu Tyr Asn Asn Asp Lys Pro Phe 
145 150 155 160 

Tyr He Asn Gly Gin Asn Gin He Lys Val Thr Arg Cys Ser Ser Asp 

165 170 175 

He Thr Gly He Phe Val Cys Glu Lys Gly Pro Cys Pro Gin Glu Asn 

180 185 190 

Cys He He Ser Lys Leu Phe Lys Glu Gly Cys Thr Phe He Tyr Asn 

195 200 205 

Ser Thr Gin Asn Ala Thr Ala Ser He Met Phe Met Gin Ser Leu Ser 

210 215 220 

Ser Val Val Glu Phe Cys Asn Ala Ser Thr His Asn Gin Glu Ala Pro 
225 230 235 240 

Asn Leu Gin Asn Gin Met Cys Ser Leu Arg Ser Ala Trp Asp Val He 

245 250 255 

Thr Asp Ser Ala Asp Phe His His Ser Phe Pro Met Asn Gly Thr Glu 

260 265 270 

Leu Pro Pro Pro Pro Thr Phe Ser Leu Val Glu Ala Gly Asp Lys Val 

275 280 285 

Val Cys Leu Val Leu Asp Val Ser Ser Lys Met Ala Glu Ala Asp Arg 

290 295 300 

Leu Leu Gin Leu Gin Gin Ala Ala Glu Phe Tyr Leu Met Gin He Val 
305 310 315 320 

Glu He His Thr Phe Val Gly He Ala Ser Phe Asp Ser Lys Gly Glu 

325 330 335 

He Arg Ala Gin Leu His Gin He Asn Ser Asn Asp Asp Arg Lys Leu 

340 345 350 

Leu Val Ser Tyr Leu Pro Thr Thr Val Ser Ala Lys Thr Asp He Ser 

355 360 365 

He Cys Ser Gly Leu Lys Lys Gly Phe Glu Val Val Glu Lys Leu Asn 

370 375 380 

Gly Lys Ala Tyr Gly Ser Val Met He Leu Val Thr Ser Gly Asp Asp 
385 390 395 400 

Lys Leu Leu Gly Asn Cys Leu Pro Thr Val Leu Ser Ser Gly Ser Thr 

405 410 415 

He His Ser He Ala Leu Gly Ser Ser Ala Ala Pro Asn Leu Glu Glu 

420 425 430 

Leu Ser Arg Leu Thr Gly Gly Leu Lys Phe Phe Val Pro Asp He Ser 

435 440 445 

Asn Ser Asn Ser Met He Asp Ala Phe Ser Arg He Ser Ser Gly Thr 

450 455 460 

Gly Asp He Phe Gin Gin His He Gin Leu Glu Ser Thr Gly Glu Asn 
465 470 475 480 

Val Lys Pro His His Gin Leu Lys Asn Thr Val Thr Val Asp Asn Thr 

485 ' 490 495 

Val Gly Asn Asp Thr Met Phe Leu Val Thr Trp Gin Ala Ser Gly Pro 

500 505 . 510 

Pro Glu He He Leu Phe Asp Pro Asp Gly Arg Lys Tyr Tyr Thr Asn 

515 520 525 

Asn Phe He Thr Asn Leu Thr Phe Arg Thr Ala Ser Leu Trp He Pro 

530 535 540 

Gly Thr Ala Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His 
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545 



His 


Ser 


Leu 


Gin 


Ser 


Ala 


Val 


Pro 








580 


Leu 


His 


Phe 


Pro 






595 




Phe 


Tyr 


Pro 


He 




610 






Thr 


Gly 


Asp 


Pro 


625 








Asp 


Val 


lie 


Lys 


Ala 


Ala 


Asn 


Gly 








660 


Ser 


He 


Ser 


Thr 






675 




Val 


Pro 


Gly 


Tyr 




690 






Lys 


Ser 


Val 


Gly 


705 








Val 


Ser 


Ser 


Gly 


His 


Pro 


Asp 


Val 








740 


Lvs 

J; ^ 


Val 


Glu 


Glu 






755 




Phe 


Asp 


Gin 


Glv 




770 






Leu 


Gin 


Asn 


lie 


785 








Ser 


Lys 


Arg 


Asn 


Ser 


Pro 


Gin 


He 








820 


Thr 


His 


Glu 


Ser 






835 




Asn 


Ser 


Leu 


Gin 




850 






He 


Pro 


Pro 


Asn 



865 





550 






Ala 


Leu 


Lys 


Val 


565 








Pro 


Ala 


Thr 


vai 


His 


Pro 


Val 


Met 








600 


Leu 


Asn 


Ala 


Thr 






615 




Val 


Thr 


Leu 


Arg 




630 






Asn Asp 


Gly 


He 


645 








Arg 


Tyr 


Ser 


Leu 


Pro 


Ala 


His 


Ser 








680 


Thr 


Ala 


Asn 


Gly 






695 




Arg Asn 


Glu 


Glu 




710 






Gly Ser 


Phe 


Ser 


725 








Phe 


Pro 


Pro 


Cys 


Glu 


Leu 


Thr 


Leu 








760 


Gin 


Ala 


Thr 


Ser 






775 




Gin Asp 


Asp 


Phe 




790 






Pro 


Gin 


Gin 


Ala 


805 








Ser 


Thr 


Asn 


Gly 


His 


Arg 


He 


Tyr 








840 


Ser 


Ala 


Val 


Ser 






855 




Ser Asp 


Pro 


Val 




870 











555 




Thr 


Val 


Thr 


Ser 




570 






Glu 


Ala 


Phe 


Val 


585 








He 


Tyr 


Ala 


Asn 


Val 


Thr 


Ala 


Thr 








620 


Leu 


Leu 


Asp 


Asp 






635 




Tyr 


Ser 


Arg 


Tyr 




650 






Lys 


Val 


His 


Val 


665 








He 


Pro 


Gly 


Ser 


Asn 


He 


Gin 


Met 








700 


Glu 


Ara 


Lvs 


Trp 






715 




Val 


Leu 


Glv 


Val 




730 






Lys 


He 


He 


Asp 


745 








Ser 


Trp 


Thr 


Ala 


Tvr 


Glu 


He 


Ara 








780 


Asn 


Asn 


Ala 


He 






795 




Gly 


He 


Arg 


Glu 




810 






Pro 


Glu 


His 


Gin 


825 








Val 


Ala 


He 


Arg 


Asn 


He 


Ala 


Gin 








860 


Pro 


Ala 


Arg 


Asp 






875 





560 



Ara 


Ala 


Ser 


Asn 






575 




Glu 


Ara 


Asp 


Ser 




590 






Val 


Lys 


Gin 


Gly 


605 








Val 


Glu 


Pro 


Glu 


Gly 


Ala 


Gly 


Ala 








640 


Phe 


Phe 


Ser 


Phe 






655 




Asn 


His 


Ser 


Pro 




670 






His 


Ala 


Met 


Tyr 


685 








Asn 


Ala 


Pro 


Arg 


Glv 


Phe 


Ser 


Arg 








720 


Pro 


Ala 


Glv 


Pro 






735 




Leu 


Glu 


Ala' 


Val 




750 






Pro 


Gly 


Glu 


Asp 


765 








Met 


Ser 


Lys 


Ser 


Leu 


Val 


Asn 


Thr 








800 


He 


Phe 


Thr 


Phe 






815 




Pro 


Asn 


Gly 


Glu 




830 






Ala 


Met 


Asp 


Arg 


845 








Ala 


Pro 


Leu 


Phe 


Tyr 


Leu 


He 


Leu 



880 



<210> 431 

<211> 2646 

<212> DNA 

<213> Homo sapiens 

<400> 431 

atgcagcatc accaccatca 
ctcattgcaa ttaatcctca 
atgataactg aagcttcatt 
aatataaaga ttttaatacc 
gaatcatatg aaaaggcaaa 
ccatacaccc tacaatacag 
aatttcctac tgaatgataa 



ccacggagta cagcttcaag 
ggtacctgag aatcagaacc 
ttacctattt aatgctacca 
tgccacatgg aaagctaata 
tgtcatagtg actgactggt 
agggtgtgga aaagagggaa 
cttaacagct ggctacggat 



acaatgggta taatggattg 60 
tcatctcaaa cattaaggaa 120 
agagaagagt atttttcaga 180 
ataacagcaa aataaaacaa 240 
atggggcaca tggagatgat 300 
aatacattca tttcacacct 360 
cacgaggccg agtgtttgtc 420 
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catgaatggg 

tacataaatg 
tttgtgtgtg 
gaaggatgca 
caaagtttat 
aacctacaga 
gactttcacc 
cttgtagagg 
gaggctgaca 
gaaattcata 
ctacaccaaa 
gtatcagcta 
gaaaaactga 
aagcttcttg 
gccctgggtt 
aagttctttg 
tcctctggaa 
gtcaaacctc 
actatgtttc 
gatggacgaa 
ctttggattc 
cattctctgc 
ccagccactg 
atttatgcca 
gttgagccag 
gatgttataa 
agatatagct 
attccaggga 
aatgctccaa 
gtcagctcag 
tttccaccat 
tcttggacag 
atgagtaaaa 
tcaaagcgaa 
tccacgaatg 
gttgcaatac 
gcgcctctgt 
aaataa 



cccacctccg 

ggcaaaatca 
aaaaaggtcc 
cctttatcta 
cttctgtggt 
accagatgtg 
acagctttcc 
ctggtgacaa 
gactccttca 
ccttcgtggg 
ttaacagcaa 
aaacagacat 
atggaaaagc 
gcaattgctt 
catctgcagc 
ttccagatat 
ctggagacat 
accatcaatt 
tagttacgtg 
aatactacac 
caggaacagc 
aagccctgaa 
tggaagcctt 
atgtgaaaca 
agactggaga 
aaaatgatgg 
tgaaagtgca 
gtcatgctat 
ggaaatcagt 
gaggctcctt 
gcaaaattat 
cacctggaga 
gtctacagaa 
atcctcagca 
gacctgaaca 
gagcaatgga 
ttattccccc 



ttggggtgtg 
aattaaagtg 
ttgcccccaa 
caatagcacc 
tgaattttgt 
cagcctcaga 
catgaacggg 
agtggtctgt 
actacaacaa 
cattgccagt 
tgatgatcga 
cagcatttgt 
ttatggctct 
acccactgtg 
cccaaatctg 
atcaaactcc 
tttccagcaa 
gaaaaacaca 
gcaggccagt 
aaataatttt 
taagcctggg 
agtgacagtg 
tgtggaaaga 
gggattttat 
tcctgttacg 
aatttactcg 
tgtcaatcac 
gtatgtacca 
aggcagaaat 
ttcagtgctg 
tgacctggaa 
agactttgat 
tatccaagat 
agctggcatc 
tcagccaaat 
taggaactcc 
caattctgat 



ttcgatgagt 

acaaggtgtt 
gaaaactgta 
caaaatgcaa 
aatgcaagta 
agtgcatggg 
actgagcttc 
ttagtgctgg 
gccgcagaat 
ttcgacagca 
aagttgctgg 
tcagggctta 
gtgatgatat 
ctcagcagtg 
gaggaattat 
aatagcatga 
catattcagc 
gtgactgtgg 
ggtcctcctg 
atcaccaatc 
cactggactt 
acctctcgcg 
gacagcctcc 
cccattctta 
ctgagactcc 
aggtattttt 
tctcccagca 
ggttacacag 
gaggaggagc 
ggagttccag 
gctgtaaaagi 
cagggccagg 
gactttaaca 
agggagatat 
ggagaaacac 
ttacagtctg 
cctgtacctg 



ataacaatga 

catctgacat 
ttattagtaa 
ctgcatcaat 
cccacaacca 
atgtaatcac 
cacctcctcc 
atgtgtccag 
tttatttgat 
aaggagagat 
tttcatatct 
agaaaggatt 
tagtgaccag 
gttcaacaat 
cacgtcttac 
ttgatgcttt 
ttgaaagtac 
ataatactgt 
agattatatt 
taacttttcg 
acaccctgaa 
cctccaactc 
attttcctca 
atgccactgt 
ttgatgatgg 
tctcctttgc 
taagcacccc 
caaacggtaa 
gaaagtgggg 
ctggccccca 
tagaagagga 
ctacaagcta 
atgctatttt 
ttacgttctc 
atgaaagcca 
ctgtatctaa 
ccagagatta. 



caaacctttc 

cacaggcatt 
gctttttaaa 
aatgttcatg 
agaagcacca 
agactctgct 
cacattctcg 
caagatggca 
gcagattgtt 
cagagcccag 
gcccaccact 
tgaggtggtt 
cggagatgat 
tcactccatt 
aggaggttta 
cagtagaatt 
aggtgaaaat 
gggcaacgac 
atttgatcct 
gacagctagt 
caatacccat 
agctgtgccc 
tcctgtgatg 
cactgccaca 
agcaggtgct 
tgcaaatggt 
agcccactct 
tattcagatg 
ctttagccga 
ccctgatgtg 
attgacccta 
tgaaataaga 
agtaaataca 
accccaaatt 
cagaatttat 
cattgcccag 
tcttatattg 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2646 



<210> 432 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 432 

cgcctgctcg agtcattaat attcatcaga aaatgg 

<210> 433 
<211> 371 

<212> PRT 

<213> Homo sapiens 
<400> 433 

Met Gin His His His .His His His Trp Gin Pro Leu Phe Phe Lys Trp 
15 10 15 



36 
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Leu 


Leu 


Ser 


Cys 


Cys Pro Gly Ser Ser Gin He 


Ala Ala Ala Ala Ser 








20 


25 


30 


Thr 


Gin 


Pro 


Glu 


Asp Asp He Asn Thr Gin Arg 


Lys Lys Ser Gin Glu 






35 




40 


45 


Lys 


Met 


Arg 


Glu 


Val Thr Asp Ser Pro Gly Arg 


Pro Arg Glu Leu Thr 




50 






55 


60 


He 


Pro 


Gin 


Thr 


Ser Ser His Gly Ala Asn Arg 


Phe Val Pro Lys Ser 


65 








70 ' 75 


80 


Lys 


Ala 


Leu 


Glu 


Ala Val Lys Leu Ala He Glu 


Ala Gly Phe His His 










85 90 


95 


He 


Asp 


Ser 


Ala 


His Val Tyr Asn Asn Glu Glu 


Gin Val Gly Leu Ala 








100 


105 


110 


He 


Arg 


Ser 


Lys 


He Ala Asp Gly Ser Val Lys 


Arg Glu Asp He Phe 






115 




120 


125 


Tyr 


Thr 


Ser 


Lys 


Leu Trp Ser Asn Ser His Arg 


Pro Glu Leu Val Arg 




130 






135 


140 


Pro 


Ala 


Leu 


Glu 


Arg Ser Leu Lys Asn Leu Gin 


Leu Asp Tyr Val Asp 


145 








150 155 


160 


Leu 


Tyr 


Leu 


He 


His Phe Pro Val Ser Val Lys 


Pro Gly Glu Glu Val 










165 170 


175 


He 


Pro 


Lvs 


Asp 


Glu Asn Gly Lys He Leu Phe 


Asp Thr Val Asp Leu 








180 


185 


190 


Cvs 


Ala 


Thr 


Trp 


Glu Ala Met Glu Lys Cys Lys 


Asp Ala Gly Leu Ala 






195 




200 


205 


Lvs 


Ser 


He 


Gly 


Val Ser Asn Phe Asn His Arg 


Leu Leu Glu Met He 




210 






215 


220 


Leu 


Asn 


Lvs 


Pro 


Gly Leu Lys Tyr Lys Pro Val 


Cys Asn Gin Val Glu 


225 








230 235 


240 


Cvs 

Jr 


His 


Pro 


Tvr 

Jr 


Phe Asn Gin Arg Lys Leu Leu 


Asp Phe Cys Lys Ser 










245 250 


255 


Lvs 


Asp 


He 


Val 


Leu Val Ala Tyr Ser Ala Leu 


Gly Ser His Arg Glu 








260 


265 


270 


Glu 


Pro 


Trp 


Val 


Asp Pro Asn Ser Pro Val Leu 


Leu Glu Asp Pro Val 






275 




280 


285 


Leu 


Cvs 


Ala 


Leu 


Ala Lys Lys His Lys Arg Thr 


Pro Ala Leu He Ala 




290 






295 


300 


Leu 


Arg 


Tyr 


Gin 


Leu Gin Arg Gly Val Val Val 


Leu Ala Lys Ser Tyr 


305 








310 315 


320 


Asn 


Glu 


Gin 


Arg 


He Arg Gin Asn Val Gin Val 


Phe Glu Phe Gin Leu 










325 330 


335 


Thr 


Ser 


Glu 


Glu 


Met Lys Ala He Asp Gly Leu 


Asn Arg Asn Val Arg 








340 


345 


350- 


Tyr 


Leu 


Thr 


Leu 


Asp He Phe Ala Gly Pro Pro 


Asn Tyr Pro Phe Ser 






355 




360 


365 


Asp 


Glu 


Tyr 










370 











<210> 434 
<211> 1119 
<212> DNA 

<213> Homo sapiens 
<400> 434 

atgcagcatc accaccatca ccactggcag cccctcttct 
tgccctggga gttctcaaat tgctgcagca gcctccaccc 
acacagagga agaagagtca ggaaaagatg agagaagtta 
cgagagctta ccattcctca gacttcttca catggtgcta 



tcaagtggct cttgtcctgt 60 

agcctgagga tgacatcaat 120 

cagactctcc tgggcgaccc 180 

acagatttgt tcctaaaagt 240 
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aaagctctag 
catgtttaca 
sgtgtgaaga 
gagttggtcc 
ctctatctta 
gaaaatggaa 
aagtgtaaag 
ctggagatga 
tgtcatcctt 
ctggttgcct 
ccggtgctct 
gccctgattg 
aatgagcagc 
atgaaagcca 
ggccccccta 



aggccgtcaa 
ataatgagga 
gagaagacat 
gaccagcctt 
ttcattttcc 
aaatactatt 
atgcaggatt 
tcctcaacaa 
acttcaacca 
atagtgctct 
tggaggaccc 
ccctgcgcta 
gcatcagaca 
tagatggcct 
attatccatt 



attggcaata 

gcaggttgga 

attctacact 

ggaaaggtca 

agtgtctgta* 

tgacacagtg 

ggccaagtcc 

gccagggctc 

gagaaaactg 

gggatcccat 

agtcctttgt 

ccagctgcag 

gaacgtgcag 

aaacagaaat 

ttctgatgaa 



gaagccgggt 
ctggccatcc 
tcaaagcttt 
ctgaaaaatc 
aagccaggtg 
gatctctgtg 
atcggggtgt 
aagtacaagc 
ctggatttct 
cgagaagaac 
gccttggcaa 
cgtggggttg 
gtgtttgaat 
gtgcgatatt 
tattaatga 



tccaccatat 
gaagcaagat 
ggagcaattc 
ttcaattgga 
aggaagtgat 
ccacatggga 
ccaacttcaa 
ctgtctgcaa 
gcaagtcaaa 
catgggtgga 
aaaagcacaa 
tggtcctggc 
tccagttgac 
tgacccttga 



tgattctgca 
tgcagatggc 
ccatcgacca 
ctatgttgac 
cccaaaagat 
ggccatggag 
ccacaggctg 
ccaggtggaa 
agacattgtt 
cccgaactcc 
gcgaacccca 
caagagctac 
ttcagaggag 
tatttttgct 



<210> 435 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 435 

ggatccgccg ccaccatgac atccattcga gctgta 

<210> 436 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 436 

gtcgactcag ctggaccaca gccgcag 

<210> 437 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1119 



36 



27 



<400> 437 

ggatccgccg ccaccatgga ctcctggacc ttctgct 

<210> 438 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



37 



<400> 438 
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gtcgactcag aaatcctttc tcttgac 



27 



<210> 439 
<211> 933 
<212> DNA 
<213> Homo sapiens 



<400> 439 

atggactcct 

gctggagtta 

agatgtaaac 

ggactggagt 

gaggatcgat 

tcagaaccca 

gaagctttct 

ccacccgagg 

acactggtgt 

aatgggaagg 

gccctcaatg 

cagaaccccc 

gagtggaccc 

agagcagact 

ctctatgaga 

ttgatggcca 



ggaccttctg 
tccagtcacc 
caatttcagg 
tgctcattta 
tctcagctaa 
gggactcagc 
ttggacaagg 
tcgctgtgtt 
gcctggccac 
aggtgcacag 
actccagata 
gcaaccactt 
aggatagggc 
gtggctttac 
tcctgctagg 
tggtcaagag 



ctgtgtgtcc 
ccggcacgag 
acacgactac 
ctttaacaac 
gatgcctaat 
tgtgtacttc 
caccagactc 
tgagccatca 
aggcttcttc 
tggggtcagc 
ctgcctgagc 
ccgctgtcaa 
caaacccgtc 
ctcggtgtcc 
gaaggccacc 
aaaggatttc 



ctttgcatcc 
gtgacagaga 
cttttctggt 
aacgttccga 
gcatcattct 
tgtgccagca 
acagttgtag 
gaagcagaga 
cctgaccacg 
acggacccgc 
agccgcctga 
gtccagttct 
acccagatcg 
taccagcaag 
ctgtatgctg 
tga 



tggtagcaaa 
tgggacaaga 
acagacagac 
tagatgattc 
ccactctgaa 
gtttagttgg 
aggacctgaa 
tctcccacac 
tggagctgag 
agcccctcaa 
gggtctcggc 
acgggctctc 
tcagcgccga 
gggtcctgtc 
tgctggtcag 



gcacacagat 
agtgactctg 
catgatgcgg 
agggatgccc 
gatccagccc 
agcaaacact 
caaggtgttc 
ccaaaaggcc 
ctggtgggtg 
ggagcagccc 
caccttctgg 
ggagaatgac 
ggcctggggt 
tgccaccatc 
cgcccttgtg 



<210> 440 
<211> 822 
<212> DNA 
<213> Homo sapiens 



<400> 440 

atgacatcca 

gagaatgtgg 

aagtgtactt 

aaaagacctc 

attgctgtta 

cctgaagact 

tattttggga 

gtgtaccagc 

gattctcaaa 

.gtgctagaca 

tctgactttg 

cccagcccag 

aacctaaact 

gggtttaatc 



ttcgagctgt 
agcagcatcc 
attcagacag 
agcttattat 
cattgaacaa 
cggctgtcta 
cagggacaag 
tgagagactc 
caaatgtgtc 
tgaggtctat 
catgtgcaaa 
aaagttcctg 
ttcaaaacct 
tgctcatgac 



atttatattc 
ttcaaccctg 
tgcctcaaac 
agacattcgt 
gacagccaaa 
cttctgtgca 
tttgacggtc 
taaatccagt 
acaaagtaag 
ggacttcaag 
cgccttcaac 
tgatgtcaag 
gtcagtgatt 
gctgcggctg 



ctgtggctgc 
agtgtccagg 
tacttccctt 
tcaaatgtgg 
catttctccc 
gcaagtatac 
attccaaata 
gacaagtctg 
gattctgatg 
agcaacagtg 
aacagcatta 
ctggtcgaga 
gggttccgaa 
tggtccagct 



agctggactt 
agggagacag 
ggtataagca 
gcgaaaagaa 
tgcacatcac 
tgaacaccgg 
tccagaaccc 
tctgcctatt 
tgtatatcac 
ctgtggcctg 
ttccagaaga 
aaagctttga 
tcctcctcct 
ga 



ggtgaatgga 
cgctgttatc 
agaacttgga 
agaccaacga 
agagacccaa 
taaccagttc 
tgaccctgcc 
caccgatttt 
agacaaaact 
gagcaacaaa 
caccttcttc 
aacagatacg 
gaaagtggcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
933 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
822 



<210> 441 

<211> 2311 

<212> DNA 

<213> Homo sapiens 



<400> 441 

gatttaatcc 

aagagttggt 

gtctccctgt 

aaccgcccag 

aaacactcca 



tatgacaaac 
gtttgctcag 
tctgtcctag 
agtagaagat 
ccagcattgg 



taagttggtt 
gaagagattt 
ctatgttcct 
ggattggggc 
aaagatctgg 



ctgtcttcac 
aagcatgctt 
gtgttgtgtg 
acgctgcaga 
ctcaccgtcc 



ctgttttggt 
gcttacccag 
cattcgtctt 
cgatcctggg 
tcttcatttt 



gaggttgtgt 60 
actcagagaa 120 
ttccagagca 180 
gggtgtgaac 240 
tcgcattatg 300 
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atcctcgttg tggctgcaaa ggaggtgtgg ggagatgagc aggccgactt tgtctgcaac 360 
accctgcagc caggctgcaa gaacgtgtgc tacgatcact acttccccat ctcccacatc 420 
cggctatggg ccctgcagct gatcttcgtg tccagcccag cgctcctagt ggccatgcac 480 
gtggcctacc ggagacatga gaagaagagg aagttcatca agggggagat aaagagtgaa 540 
tttaaggaca tcgaggagat caaaacccag aaggtccgca tcgaaggctc cctgtggtgg 600 
acctacacaa gcagcatctt cttccgggtc atcttcgaag ccgccttcat gtacgtcttc 660 
tatgtcatgt acgacggctt ctccatgcag cggctggtga agtgcaacgc ctggccttgt 720 
cccaacactg tggactgctt tgtgtcccgg cccacggaga agactgtctt cacagtgttc 780 
atgattgcag tgtctggaat ttgcatcctg ctgaatgtca ctgaattgtg ttatttgcta 840 
attagatatt gttctgggaa gtcaaaaaag ccagtttaac gcattgccca gttgttagat 900 
taagaaatag acagcatgag agggatgagg caacccgtgc tcagctgtca aggctcagtc 960 
gccagcattt cccaacacaa agattctgac cttaaatgca accatttgaa acccctgtag 1020 
gcctcaggtg aaactccaga tgccacaatg agctctgctc ccctaaagcc tcaaaacaaa 1080 
ggcctaattc tatgcctgtc ttaattttct ttcacttaag ttagttccac tgagacccca 1140 
ggctgttagg ggttattggt gtaaggtact ttcatatttt aaacagagga tatcggcatt 1200 
tgtttctttc tctgaggaca agagaaaaaa gccaggttcc acagaggaca cagagaaggt 1260 
ttgggtgtcc tcctggggtt ctttttgcca actttcccca cgttaaaggt gaacattggt 1320 
tctttcattt gctttggaag ttttaatctc taacagtgga caaagttacc agtgccttaa 1380 
actctgttac actttttgga agtgaaaact ttgtagtatg ataggttatt ttgatgtaaa 1440 
gatgttctgg ataccattat atgttccccc tgtttcagag gctcagattg taatatgtaa 1500 
atggtatgtc attcgctact atgatttaat ttgaaatatg gtcttttggt tatgaatact 1560 
ttgcagcaca gctgagagag gctgtctgtt gtattcattg tggtcatagc acctaacaac 1620 
attgtagcct caatcgagtg agacagacta gaagttccta gttggcttat gatagcaaat 1680 
ggcctcatgt caaatattag atgtaatttt gtgtaagaaa tacagactgg atgtaccacc 1740 
aactactacc tgtaatgaca ggcctgtcca acacatctcc cttttccatg ctgtggtagc 1800 
cagcatcgga aagaacgctg at'ttaaagag gtgagcttgg gaattttatt gacacagtac 1860 
catttaatgg ggagacaaaa atgggggcca ggggagggag aagtttctgt cgttaaaaac 1920 
gagtttggaa agactggact ctaaattctg ttgattaaag atgagctttg tctaccttca 1980 
aaagtttgtt tggcttaccc ccttcagcct ccaatttttt aagtgaaaat ataactaata 2040 
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 2100 
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatact 2160 
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 2220 
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 2280 
atttaaaaca ttaaaatata atctctataa t 2311 

<210> 442 
<211> 226 
<212> PRT 
<213> Homo sapiens 

<400> 442 

Met Asp Trp Gly Thr Leu Gin Thr He Leu Gly Gly Val Asn Lys His 
5 10 15 

Ser Thr Ser He Gly Lys He Trp Leu Thr Val Leu Phe He Phe Arg 
20 25 30 

He Met He Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gin 
35 40 45 

Ala Asp Phe Val Cys Asn Thr Leu Gin Pro Gly Cys Lys Asn Val Cys 
50 55 60 

Tyr Asp His Tyr Phe Pro He Ser His He Arg Leu Trp Ala Leu Gin 
65 70 75 80 

Leu He Phe Val Ser Ser Pro Ala Leu Leu Val Ala Met His Val Ala 
85 90 95 
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Tyr Arg Arg His Glu Lys Lys Arg Lys Phe He hys Gly Glu He Lys 
100 105 110 

Ser Glu Phe Lys Asp He Glu Glu He Lys Thr Gin Lys Val Arg He 
115 120 125 

Glu Gly Ser Leu Trp Tarp Thr Tyr Thr Ser Ser He Phe Phe Arg Val 
130 135 140 

He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Val Met Tyr Asp Gly 
145 150 155 160 

Phe Ser Met Gin Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn 
165 170 175 

Thr Val Asp Cys Phe Val Ser Arg Pro Thr Glu Lys Thr Val Phe Thr 
180 185 190 

Val Phe Met He Ala Val Ser Gly He Cys He Leu Leu Asn Val Thr 
195 200 205 



Glu Leu Cys Tyr Leu Leu He Arg Tyr Cys Ser Gly Lys Ser Lys Lys 
210 215 220 



Pro Val 
225 



<210> 443 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 443 

Val Lys Leu Cys Gly He Asp Pro Cys Pro Asn Leu Val Asp Cys Phe 
5 10 15 

He Ser Arg Pro Gly Cys Gly 
20 



<210> 444 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 444 

caatcaggca tgcacaacaa actgtatatc ggaaac 36 



<210> 445 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> PGR primer 

<400> 445 

cgtcaagatc ttcattactt ccgtcttgac 30 

<210> 446 
<211> 579 

<212> PRT 

<213> Homo sapiens 
<400> 446 

Met Asn Lys Leu Tyr He Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 
5 10 15 

Asp Leu Glu Ser He Phe Lys Asp Ala Lys He Pro Val Ser Gly Pro 
20 25 30 

Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser 
35 40 45 

Trp Ala Leu Lys Ala He Glu Ala Leu Ser Gly Lys He Glu Leu His 
50 55 60 

Gly Lys Pro He Glu Val Glu His Ser Val Pro Lys Arg Gin Arg He 
65 70 75 80 

Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu Val 
85 90 95 

Leu Asp Ser Leu Leu Val Gin Tyr Gly Val Val Glu Ser Cys Glu Gin 
100 105 110 

Val Asn Thr Asp Ser Glu Thr Ala Val Val Asn Val Thr Tyr Ser Ser 
115 120 125 

Lys Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu 

130 135 140 

Glu Asn Phe Thr Leu Lys Val Ala Tyr He Pro Asp Glu Thr Ala Ala 
145 150 155 160 

Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly Gin 
165 170 175 

Arg Gly Ser Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys 
180 185 190 

Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin Phe Val Gly 
195 200 205 

Ala He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin 
210 215 220 

Thr Gin Ser Lys He Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala 
225 230 235 240 
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Glu Lys Ser lie Thr lie Leu Ser Thr Pro Glu Gly Thr Ser Ala Ala 
245 250 255 

Cys Lys Ser lie Leu Glu. lie Met His Lys Glu Ala Gin Asp lie Lys 
260 265 270 

Phe Thr Glu Glu lie Pro Leu Lys lie Leu Ala His Asn Asn Phe Val 
275 280 285 

Gly Arg Leu He Gly Lys Glu Gly Arg Asn Leu Lys Lys He Glu Gin 
290 295 300 

Asp Thr Asp Thr Lys He Thr lie Ser Pro Leu Gin Glu Leu Thr Leu 
305 310 315 320 

Tyr Asn Pro Glu Arg Thr He. Thr Val Lys Gly Asn Val Glu Thr Cys 
325 330 335 

Ala Lys Ala Glu Glu Glu He Met Lys Lys He Arg Glu Ser Tyr Glu 
340 345 350 

Asn Asp He Ala Ser Met Asn Leu Gin Ala His Leu He Pro Gly Leu 
355 360 365 

Asn Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro 
370 375 380 

Pro Thr Ser Gly Pro Pro Ser Ala Met Thr Pro Pro Tyr Pro Gin Phe 
385 390 395 400 

Glu Gin Ser Glu Thr Glu Thr Val His Leu Phe He Pro Ala Leu Ser 
405 410 415 

Val Gly Ala He He Gly Lys Gin Gly Gin His He Lys Gin Leu Ser 
420 425 430 

Arg Phe Ala Gly Ala Ser He Lys He Ala Pro Ala Glu Ala Pro Asp 

435 440 445 

Ala Lys Val Arg Met Val He He Thr Gly Pro Pro Glu Ala Gin Phe 
450 455 460 

Lys Ala Gin Gly Arg He Tyr Gly Lys He Lys Glu Glu Asn Phe Val 
465 470 475 480 

Ser Pro Lys Glu Glu Val Lys Leu Glu Ala His He Arg Val Pro Ser 

485 490 495 

Phe Ala Ala Gly Arg Val He Gly Lys Gly Gly Lys Thr Val Asn Glu 
500 505 510 

Leu Gin Asn Leu Ser Ser Ala Glu Val Val Val Pro Arg Asp Gin Thr 
515 520 525 

Pro Asp Glu Asn Asp Gin Val Val Val Lys He Thr Gly His Phe Tyr 
530 535 540 



Ala Cys Gin Val Ala Gin Arg Lys He Gin Glu He Leu Thr Gin Val 
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545 550 555 560 

Lys Gin His Gin Gin Gin Lys Ala Leu Gin Ser Gly Pro Pro Gin Ser 
565 570 575 

Arg Arg Lys 



<210> 447 

<211> 1743 

<212> DNA 

<213> Homo sapiens 

<400> 447 

atgaacaaac tgtatatcgg aaacctcagc 
atcttcaagg acgccaagat cccggtgtcg 
ttcgtggact gcccggacga gagctgggcc 
atagaactgc acgggaaacc catagaagtt 
cggaaacttc agatacgaaa tatcccgcct 
ctagtccagt atggagtggt ggagagctgt 
gttgtaaatg taacctattc cagtaaggac 
ggatttcagt tagagaattt caccttgaaa 
cagcaaaacc ccttgcagca gccccgaggt 
aggcaggggt ctccaggatc cgtatccaag 
ctggttccca cccaatttgt tggagccatc 
atcaccaaac agacccagtc taaaatcgat 
gagaagtcga ttactatcct ctctactcct 
ctggagatta tgcataagga agctcaagat 
attttagctc ataataactt tgttggacgt 
aaaattgagc aagacacaga cactaaaatc 
tataatccag aacgcactat tacagttaaa 
gaggagatca tgaagaaaat cagggagtct 
caagcacatt taattcctgg attaaatctg 
gggatgccac ctcccacctc agggccccct 
gagcaatcag aaacggagac tgttcatctg 
atcggcaagc agggccagca catcaagcag 
attgctccag cggaagcacc agatgctaaa 
gaggctcagt tcaaggctca gggaagaatt 
agtcctaaag aagaggtgaa acttgaagct 
agagttattg gaaaaggagg caaaacggtg 
gttgttgtcc ctcgtgacca gacacctgat 
ggtcacttct atgcttgcca ggttgcccag 
aagcagcacc aacaacagaa ggctctgcaa 
tga 



<210> 448 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR primer 



gagaacgccg ccccctcgga cctagaaagt 60 
ggacccttcc tggtgaagac tggctacgcg 120 
ctcaaggcca tcgaggcgct ttcaggtaaa 180 
gagcactcgg tcccaaaaag gcaaaggatt 240 
catttacagt gggaggtgct ggatagttta 300 
gagcaagtga acactgactc ggaaactgca 360 
caagctagac aagcactaga caaactgaat 420 
gtagcctata tccctgatga aacggccgcc 480 
cgccgggggc ttgggcagag gggctcctca 540 
cagaaaccat gtgatttgcc tctgcgcctg 600 
ataggaaaag aaggtgccac cattcggaac 660 
gtccaccgta aagaaaatgc gggggctgct 720 
gaaggcacct ctgcggcttg taagtctatt 780 
ataaaattca cagaagagat ccccttgaag 840 
cttattggta aagaaggaag aaatcttaaa 900 
acgatatctc cattgcagga attgacgctg 960 
ggcaatgttg agacatgtgc caaagctgag 1020 
tatgaaaatg atattgcttc tatgaatctt 1080 
aacgccttgg gtctgttccc acccacttca 1140 
tcagccatga ctcctcccta cccgcagttt 1200 
tttatcccag ctctatcagt cggtgccatc 1260 
ctttctcgct ttgctggagc ttcaattaag 1320 
gtgaggatgg tgattatcac tggaccacca 1380 
tatggaaaaa ttaaagaaga aaactttgtt 1440 
catatcagag tgccatcctt tgctgctggc 1500 
aatgaacttc agaatttgtc aagtgcagaa 1560 
gagaatgacc aagtggttgt caaaataact 1620 
agaaaaattc aggaaattct gactcaggta 1680 
agtggaccac ctcagtcaag acggaagtaa 1740 

1743 



<400> 448 

cgtactagca tatgaacaaa ctgtatatcg gaaac 



35 
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<210> 449 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 449 

Met Asn Lys Leu Tyr lie Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 
5 10 15 

Asp Leu Glu Ser lie Phe Lys Asp Ala Lys lie Pro Val Ser Gly Pro 
20 25 30 

Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser 
35 40 45 

Trp Ala Leu Lys Ala lie Glu Ala Leu Ser Gly Lys lie Glu' Leu His 
50 55 60 

Gly Lys Pro He Glu Val Glu His Ser Val Pro Lys Arg Gin Arg He 
65 70 75 80 

Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Txp Glu Val 
85 90 95 

Leu Asp Ser Leu Leu Val Gin Tyr Gly Val Val Glu Ser Cys Glu Gin 
100 105 110 

Val Asn Thr Asp Ser Glu Thr Ala Val Val Asn Val Thr Tyr Ser Ser 
115 120 125 

Lys Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu 
130 135 140 

Glu Asn Phe Thr Leu Lys Val Ala Tyr He Pro Asp Glu Thr Ala Ala 
145 150 155 160 

Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly Gin 
165 170 175 

Arg Gly Ser Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys 
180 185 190 

Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin Phe Val Gly 
195 200 205 

Ala He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin 

210 215 220 

Thr Gin Ser Lys He Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala 
225 230 235 240 

Glu Lys Ser He Thr He Leu Ser Thr Pro Glu Gly Thr Ser Ala Ala 
245 250 255 

Cys Lys Ser He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys 
260 265 270 

Phe Thr Glu Glu He Pro Leu Lys He Leu Ala His Asn Asn Phe Val 
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Gly Arg 
290 

Asp Thr 
305 

Tyr Asn 
Ala Lys 
Asn Asp 



275 

Leu He Gly 
Asp Thr Lys 



Pro Glu Arg 

325 

Ala Glu Glu 
340 

He Ala Ser 
355 



280 

Lys Glu Gly 
295 

He Thr He 
310 

Thr He Thr 



Glu He Met 



Met Asn Leu 
360 



Arg Asn 



Ser Pro 



Val Lys 
330 

Lys Lys 
345 



Leu Lys 
300 

Leu Gin 
315 

Gly Asn 
He Arg 



Gin Ala His Leu 



285 

Lys He Glu Gin 



Glu Leu Thr Leu 
320 

Val Glu Thr Cys 
335 

Glu Ser Tyr Glu 
350 

He Pro Gly Leu 
365 



Asn Leu 
370 

Pro Thr 
385 



Asn Ala Leu 



Ser Gly Pro 



Gly Leu Phe 
375 

Pro Ser Ala 
390 



Pro Pro 



Met Thr 



Thr Ser 
380 

Pro Pro 
395 



Gly Met Pro Pro 



Tyr Pro 



Gin Phe 
400 



Glu Gin 
Val Gly 
Arg Phe 



Ala Lys 
450 
Lys Ala 
465 

Ser Pro 



Phe Ala 

Leu Gin 

Pro Asp 
530 

Ala Cys 
545 

Lys Gin 
Arg Arg 



Ser Glu Thr 
405 

Ala He He 
420 

Ala Gly Ala 

435 

Val Arg Met 
Gin Gly Arg 

Lys Glu Glu 
485 

Ala Gly Arg 
500 

Asn Leu Ser 

515 

Glu Asn Asp 
Gin Val Ala 



His Gin Gin 
565 

Lys 



Glu Thr Val 
Gly Lys Gin 

Ser He Lys 

440 

Val He He 

455 
He Tyr Gly 
470 

Val Lys Leu 
Val He Gly 



Ser Ala Glu 

520 

Gin Val Val 
535 

Gin Arg Lys 
550 

Gin Lys Ala 



His Leu 
410 

Gly Gin 
425 



Phe He Pro Ala 



His He 



He Ala Pro Ala 



Lys Gin 
430 

Glu Ala 
445 



Leu Ser 
415 

Leu Ser 



Pro Asp 



Thr Gly 
Lys He 



Glu Ala 
490 



Pro Pro 
460 
Lys Glu 
475 



Glu Ala Gin Phe 
Glu Asn 



His He Arg Val 



Lys Gly 
505 



Gly Lys 
Val Val Val Pro 



Thr Val 
510 

Arg Asp 

525 



Phe Val 
480 

Pro Ser 
495 

Asn Glu 



Gin Thr 



Val Lys 
He Gin 



Leu Gin 
570 



He Thr 
540 

Glu He 
555 



Gly His Phe Tyr 



Leu Thr Gin Val 
560 



Ser Gly Pro Pro Gin Ser 
575 
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<210> 450 
<211> 1743 
<212> DNA 

<213> Homo sapiens 

<400> 450 

atgaacaaac tgtatatcgg aaacctcagc 
atcttcaagg acgccaagat cccggtgtcg 
ttcgtggact gcccggacga gagctgggcc 
atagaactgc acgggaaacc catagaagtt 
cggaaacttc agatacgaaa tatcccgcct 
ctagtccagt atggagtggt ggagagctgt 
gttgtaaatg taacctattc cagtaaggac 
ggatttcagt tagagaattt caccttgaaa 
cagcaaaacc ccttgcagca gccccgaggt 
aggcaggggt ctccaggatc cgtatccaag 
ctggttccca cccaatttgt tggagccatc 
atcaccaaac agacccagtc taaaatcgat 
gagaagtcga ttactatcct ctctactcct 
ctggagatta tgcataagga agctcaagat 
attttagctc ataataactt tgttggacgt 
aaaattgagc aagacacaga cactaaaatc 
tataatccag aacgcactat tacagttaaa 
gaggagatca tgaagaaaat cagggagtct 
caagcacatt taattcctgg attaaatctg 
gggatgccac ctcccacctc agggccccct 
gagcaatcag aaacggagac tgttcatctg 
atcggcaagc agggccagca catcaagcag 
attgctccag cggaagcacc agatgctaaa 
gaggctcagt tcaaggctca gggaagaatt 
agtcctaaag aagaggtgaa acttgaagct 
agagttattg gaaaaggagg caaaacggtg 
gttgttgtcc ctcgtgacca gacacctgat 
ggtcacttct atgcttgcca ggttgcccag 
aagcagcacc aacaacagaa ggctctgcaa 
tga 



gagaacgccg ccccctcgga cctagaaagt 60 
ggacccttcc tggtgaagac tggctacgcg 120 
ctcaaggcca tcgaggcgct ttcaggtaaa 180 
gagcactcgg tcccaaaaag gcaaaggatt 240 
catttacagt gggaggtgct ggatagttta 300 
gagcaagtga acactgactc ggaaactgca 360 
caagctagac aagcactaga caaactgaat 420 
gtagcctata tccctgatga aacggccgcc 480 
cgccgggggc ttgggcagag gggctcctca 540 
cagaaaccat gtgatttgcc tctgcgcctg 600 
ataggaaaag aaggtgccac cattcggaac 660 
gtccaccgta aagaaaatgc gggggctgct 720 
gaaggcacct ctgcggcttg taagtctatt 780 
ataaaattca cagaagagat ccccttgaag 840 
cttattggta aagaaggaag aaatcttaaa 900 
acgatatctc cattgcagga attgacgctg 960 
ggcaatgttg agacatgtgc caaagctgag 1020 
tatgaaaatg atattgcttc tatgaatctt 1080 
aacgccttgg gtctgttccc acccacttca 1140 
tcagccatga ctcctcccta cccgcagttt 1200 
tttatcccag ctctatcagt cggtgccatc 1260 
ctttctcgct ttgctggagc ttcaattaag 1320 
gtgaggatgg tgattatcac tggaccacca 1380 
tatggaaaaa ttaaagaaga aaactttgtt 1440 
catatcagag tgccatcctt tgctgctggc 1500 
aatgaacttc agaatttgtc aagtgcagaa 1560 
gagaatgacc aagtggttgt caaaataact 1620 
agaaaaattc aggaaattct gactcaggta 1680 
agtggaccac ctcagtcaag acggaagtaa 1740 

1743 



<210> 451 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 451 

Leu Gly Lys Glu Val Arg Asp Ala Lys He Thr Pro Glu Ala Phe Glu 
5 10 15 

Lys Leu Gly Phe Pro Ala Ala Lys Glu 
20 25 



<210> 452 
<211> 25 
<212> PRT 

<213> Homo sapiens 



<400> 452 



wo 02/00174 



PCTAJSOl/21065 



195 



Lys Ala Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val Pro Met Gly Asp 
5 10 is 

Val Pro Met Asp Gly He Ser Val Ala 
20 25 



<210> 453 

<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 453 

Pro Asp Arg Asp Val Asn Leu Thr His Gin Leu Asn Pro Lys Val Lys 
5 10 15 



<210> 454 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 454 

Lys He Ala Pro Ala Glu Ala Pro Asp Ala Lys Val Arg Met Val He 
5 10 15 

He Thr Gly Pro 
20 



<210> 455 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 455 

Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly 
5 10 15 

Arg Arg Gly Leu 
20 



<210> 456 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 456 

Arg Thr He Thr Val Lys Gly Asn Val Glu Thr Cys Ala Lys Ala Glu 
5 10 15 

Glu Glu He Met 
20 



<210> 457 
<211> 20 
<212> PRT 
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<213> Homo sapiens 
<400> 457 

Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys Ala lie Glu 
5 10 15 

Ala Leu Ser Gly 
20 



<210> 458 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 458 

lie Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu 
5 10 15 

Val Leu Asp Ser 
20 



<210> 459 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 459 

Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly 
5 10 15 

Gin Arg Gly Ser 
20 



<210> 460 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 460 

Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala Glu Lys Ser He Thr 
5 10 15 

He Leu Ser Thr 
20 



<210> 461 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 461 

Leu Tyr Asn Pro Glu Arg Thr He Thr Val Lys Gly Asn Val Glu Thr 
5 10 15 



wo 02/00174 PCT/USOl/21065 

197 



Cys Ala Lys Ala 
20 



<210> 462 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 462 

Glu Glu Glu lie Met Lys Lys lie Arg Glu Ser Tyr Glu Asn Asp lie 
5 10 15 

Ala Ser Met Asn 
20 



<210> 463 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 463 

Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro Pro 
5 10 15 

Thr Ser Gly Pro 
20 



<210> 464 

<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 464 

Lys lie Ala Pro Ala Glu Ala Pro Asp Ala Lys Val Arg Met Val lie 
5 10 15 

He Thr Gly Pro 
20 

<210> 465 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 465 

Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys He 
5 10 15 

Glu 



<210> 466 
<211> 11 
<212> PRT 

<213> Homo sapiens 
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<400> 466 

Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu 
5 10 

<210> 467 

<211> 33 

<212> DNA 

<213> Homo sapiens 



<400> 467 

ttcgtggact gcccggacga gagctgggcc etc 



33 



